Computer method and system for predicting physical properties using a conceptual segment model

ABSTRACT

Methods of conducting industrial manufacture, research or development. The method comprise computer-implemented steps of modeling at least one physical property of a mixture of at least two chemical species by determining at least one conceptual segment for each of the chemical species. The steps of determining at least one conceptual segment for each of the chemical species include defining an identity and an equivalent number of each conceptual segment.

RELATED APPLICATIONS

This application is a divisional application of U.S. application Ser. No. 11/528,749, filed Sep. 27, 2006 now U.S. Pat. No. 7,941,277, which is a continuation-in-part of U.S. application Ser. No. 10/785,925, filed Feb. 24, 2004, now U.S. Pat. No. 7,672,826, and of U.S. application Ser. No. 11/241,675, filed Sep. 30, 2005, now U.S. Pat. No. 7,809,540, which is a continuation-in-part of the U.S. application Ser. No. 10/785,925 filed Feb. 24, 2004 now U.S. Pat. No. 7,672,826. The entire teachings of the above applications are incorporated herein by reference.

BACKGROUND OF THE INVENTION

Modeling physical properties of chemical mixtures is an important task in many industries and processes. Specifically, for many processes, accurate modeling of physical properties for various mixtures is crucial for such areas as process design and process control applications. For example, modeling physical properties of chemical mixtures is often useful when selecting suitable solvents for use in chemical processes.

Solvent selection is an important task in the chemical synthesis and recipe development phase of the pharmaceutical and agricultural chemical industries. The choice of solvent can have a direct impact on reaction rates, extraction efficiency, crystallization yield and productivity, etc. Improved solvent selection brings benefits, such as faster product separation and purification, reduced solvent emission and lesser waste, lower overall costs, and improved production processes.

In choosing a solvent, various phase behavior characteristics of the solvent-solute mixtures are considered. For example, vapor-liquid equilibrium (VLE) behavior is important when accounting for the emission of solvent from reaction mixtures, and liquid-liquid miscibility (LLE) is important when a second solvent is used to extract target molecules from the reaction media. For solubility calculations, solid-liquid equilibrium (SLE) is a key property when product isolation is done through crystallization at reduced temperature or with the addition of anti-solvent.

For many applications, hundreds of typical solvents, not to mention an almost infinite number of mixtures thereof, are candidates in the solvent selection process. In most cases, there is simply insufficient phase equilibrium data on which to make an informed solvent selection. For example, in pharmaceutical applications, it is often the case that phase equilibrium data involving new drug molecules in the solvents simply do not exist. Although limited solubility experiments may be taken as part of the trial and error process, solvent selection is largely dictated by researchers' preferences or prior experiences.

Many solubility estimation techniques have been used to model the solubility of components in chemical mixtures. Some examples include the Hansen model and the UNIFAC group contribution model. Unfortunately, these models are rather inadequate because they have been developed mainly for petrochemicals with molecular weights in the 10s and the low 100s daltons. These models do not extrapolate well for chemicals with larger molecular weights, such as those encountered in pharmaceutical applications. Pharmaceuticals are mostly large, complex molecules with molecular weight in the range of about 200-600 daltons.

Perhaps, the most commonly used methods in solvent selection process are the solubility parameter models, i.e., the regular solution theory and the Hansen solubility parameter model. There are no binary parameters in these solubility parameter models and they all follow merely an empirical guide of “like dissolves like.” The regular solution model is applicable to nonpolar solutions only, but not for solutions where polar or hydrogen-bonding interactions are significant. The Hansen model extends the solubility parameter concept in terms of three partial solubility parameters to better account for polar and hydrogen-bonding effects.

In his book, Hansen published the solubility parameters for over 800 solvents. See Hansen, C. M., HANSEN, SOLUBILITY PARAMETERS: A USER'S HANDBOOK (2000). Since Hansen's book contains the parameters for most common solvents, the issue in using the Hansen model lies in the determination of the Hansen solubility parameters from regression of available solubility data for the solute of interest in the solvent selection process. Once determined, these Hansen parameters provide a basis for calculating activity coefficients and solubilities for the solute in all the other solvents in the database. For pharmaceutical process design, Bakken, et al. reported that the Hansen model can only correlate solubility data with ±200% in accuracy, and it offers little predictive capability. See Bakken, et al., Solubility Modeling in Pharmaceutical Process Design, paper presented at AspenTech User Group Meeting, New Orleans, La., Oct. 5-8, 2003, and Paris, France, Oct. 19-22, 2003.

When there are no data available, the UNIFAC functional group contribution method is sometimes used for solvent selection. In comparison to the solubility parameter models, UNIFAC's strength comes with its molecular thermodynamic foundation. It describes liquid phase nonideality of a mixture with the concept of functional groups. All molecules in the mixture are characterized with a set of pre-defined UNIFAC functional groups. The liquid phase nonideality is the result of the physical interactions between these functional groups and activity coefficients of molecules are derived from those of functional groups, i.e., functional group additivity rule. These physical interactions have been pre-determined from available phase equilibrium data of systems containing these functional groups. UNIFAC gives adequate phase equilibrium (VLE, LLE and SLE) predictions for mixtures with small nonelectrolyte molecules as long as these molecules are composed of the pre-defined set of functional groups or similar groups.

UNIFAC fails for systems with large complex molecules for which either the functional group additivity rule becomes invalid or due to undefined UNIFAC functional groups. UNIFAC is also not applicable to ionic species, an important issue for pharmaceutical processes. Another drawback with UNIFAC is that, even when valuable data become available, UNIFAC cannot be used to correlate the data. For pharmaceutical process design, Bakken et al., reported that the UNIFAC model only predicts solubilities with a RMS (root mean square) error on ln x of 2, or about ±500% in accuracy, and it offers little practical value. Id.

A need exists for new, simple, and practical methods of accurately modeling one or more physical properties of a mixture of chemicals, including electrolytes.

SUMMARY OF THE INVENTION

The present invention provides an effective tool for the correlation and prediction of physical properties of a mixtures of chemical species, including electrolytes.

In one embodiment of the present invention, the present invention features methods of conducting industrial manufacture, research or development. The method comprise computer-implemented steps of modeling at least one physical property of a mixture of at least two chemical species by determining at least one conceptual segment for each of the chemical species. The steps of determining at least one conceptual segment for each of the chemical species include defining an identity and an equivalent number of each conceptual segment.

In a first preferred embodiment, the methods further comprise steps of using the determined conceptual segment to compute at least one physical property of the mixture, and providing an analysis of the computed physical property. The analysis forms a model of the at least one physical property of the mixture.

The second preferred embodiment includes the mixtures comprising an electrolyte.

Some embodiments to the second preferred embodiment, the methods further include the steps of using the determined conceptual electrolyte segment to compute at least one physical property of the mixture, and providing an analysis of the computed physical property. The analysis forms a model of the at least one physical property of the mixture.

In one embodiment, the present invention features methods of conducting a pharmaceutical activity. The methods comprise steps of modeling at least one physical property of a mixture of at least two chemical species by determining at least one conceptual segment for each of the chemical species. The steps of determining at least one conceptual segment for each of the chemical species include defining an identity and an equivalent number of each conceptual segment.

In a preferred embodiment, the methods of conducting a pharmaceutical activity further comprise steps of using the determined conceptual segments to compute at least one physical property of the mixture, and providing an analysis of the computed physical property. The analysis forms a model of the at least one physical property of the mixture. Some embodiments to the first preferred embodiment, the mixture includes at least one liquid phase and at least one solid phase. More preferably, the liquid phase is an amorphous phase.

In another preferred embodiment, the methods of conducting a pharmaceutical activity include mixtures comprising an electrolyte. Some embodiments to the second preferred embodiment further include steps of using the determined conceptual electrolyte segment to compute at least one physical property of the mixture, and providing an analysis of the computed physical property. The analysis forms a model of the at least one physical property of the mixture.

In one embodiment, the present invention features methods of separating one or more chemical species from a mixture. The methods comprise steps of modeling at least one physical property of a mixture of at least two chemical species by determining at least one conceptual segment for each of the chemical species. The steps of determining at least one conceptual segment for each of the chemical species include defining an identity and an equivalent number of each conceptual segment. Preferably, the methods of separating one or more species from a mixture use chromatography.

In a preferred embodiment of the methods of separating one or more chemical species, the methods further comprise steps of using the determined conceptual segments to compute at least one physical property of the mixture, and providing an analysis of the computed physical property. The analysis forms a model of the at least one physical property of the mixture. In some embodiments of the first preferred embodiment, the mixture includes at least one liquid phase and at least one solid phase.

In one embodiment, the invention features computer program products. The computer program products comprise a computer usable medium, and a set of computer program instructions embodied on the computer useable medium for conducting industrial manufacture, research or development by modeling at least one physical property of a mixture of at least two chemical species by determining at least one conceptual segment for each of the chemical species. The at least one conceptual segment for each of the chemical species that includes the definition of a segment number is determined.

In yet another embodiment, the invention features a computer system for conducting industrial manufacture, research or development by modeling at least one physical property of a mixture of at least two chemical species. The computer system comprises a user input means for obtaining chemical data from a user, and a digital processor coupled to receive obtained chemical data input from the input means, and an output means coupled to the digital processor. The digital processor executes a modeling system in working memory, and the modeling system uses the chemical data to determine at least one conceptual segment for each of the chemical species. The computer system further includes the output means provides to the user the formed model of the physical property of the mixture.

Correlation and prediction of chemical properties of a mixture of chemicals play a critical role in the research, development, and manufacture of industrial processes, including pharmaceutical ones. The present invention offers a practical thermodynamic framework for modeling of complex chemical molecules, including electrolytes.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing and other objects, features and advantages of the invention will be apparent from the following more particular description of preferred embodiments of the invention, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention.

FIG. 1 is a schematic view of a computer network in which the present invention may be implemented.

FIG. 2 is a block diagram of a computer of the network of FIG. 1.

FIGS. 3-4 b are flow diagrams of one embodiment of the present invention employed in the computer network environment of FIGS. 1 and 2.

FIG. 5 illustrates a graph showing the binary phase diagram for a water, 1,4-dioxane mixture at atmospheric pressure.

FIG. 6 illustrates a graph showing the binary phase diagram for a water, octanol mixture at atmospheric pressure.

FIG. 7 illustrates a graph showing the binary phase diagram for an octanol, 1,4-dioxane mixture at atmospheric pressure.

FIG. 8 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for p-aminobenzoic acid in various solvents at 298.15K.

FIG. 9 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for benzoic acid in various solvents at 298.15K.

FIG. 10 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for camphor in various solvents at 298.15K.

FIG. 11 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for ephedrine in various solvents at 298.15K.

FIG. 12 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for lidocaine in various solvents at 298.15K.

FIG. 13 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for methylparaben in various solvents at 298.15K.

FIG. 14 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for testosterone in various solvents at 298.15K.

FIG. 15 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for theophylline in various solvents at 298.15K.

FIG. 16 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for estriol in nine solvents at 298.15K.

FIG. 17 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for estrone in various solvents at 298.15K.

FIG. 18 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for morphine in six solvents at 308.15K.

FIG. 19 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for piroxicam in 14 solvents at 298.15K.

FIG. 20 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for hydrocortisone in 11 solvents at 298.15K.

FIG. 21 illustrates a graph showing data of experimental solubilities vs. calculated solubilities for haloperidol in 13 solvents at 298.15K.

FIG. 22 is a graph illustrating the effect of hydrophobicity parameter X on natural logarithm of mean ionic activity coefficient of aqueous electrolytes with E=1.

FIG. 23 is a graph illustrating the effect of polarity parameter Y− on natural logarithm of mean ionic activity coefficient of aqueous electrolytes with E=1.

FIG. 24 is a graph illustrating the effect of polarity parameter Y+ on natural logarithm of mean ionic activity coefficient of aqueous electrolytes with E=1.

FIG. 25 is a graph illustrating the effect of hydrophilicity parameter Z on natural logarithm of mean ionic activity coefficient of aqueous electrolytes with E=1.

FIG. 26 is a graph illustrating the effect of electrolyte parameter E on natural logarithm of mean ionic activity coefficient of aqueous electrolytes.

FIG. 27 is a graph illustrating comparison of experimental and calculated molality scale mean ionic activity coefficients of representative aqueous electrolytes at 298.15 K.

FIG. 28 is a graph illustrating the present invention model results for sodium chloride solubility at 298.15 K.

FIG. 29 is a graph illustrating the present invention model results for sodium acetate solubility at 298.15 K.

FIG. 30 a is a graph illustrating the present invention model results for benzoic acid solubility at 298.15 K.

FIG. 30 b is a graph illustrating the present invention model results for sodium benzoate solubility at 298.15 K.

FIG. 31 a is a graph illustrating the present invention model results for salicylic acid solubility at 298.15 K.

FIG. 31 b is a graph illustrating the present invention model results for sodium salicylate solubility at 298.15 K.

FIG. 32 a is a graph illustrating the present invention model results for p-aminobenzoic acid solubility at 298.15 K.

FIG. 32 b is a graph illustrating the present invention model results for sodium p-aminobenzoate solubility at 298.15 K.

FIG. 33 a is a graph illustrating the present invention model results for ibuprofen solubility at 298.15 K

FIG. 33 b is a graph illustrating the present invention model results for sodium ibuprofen solubility at 298.15 K.

FIG. 34 a is a graph illustrating the present invention model results for diclofenac solubility at 298.15 K.

FIG. 34 b is a graph illustrating the present invention model results for sodium diclofenac solubility at 298.15 K.

DETAILED DESCRIPTION OF THE INVENTION

A description of example embodiments of the invention follows.

While this invention has been particularly shown and described with references to example embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims.

The present invention provides a new system and method for modeling the physical properties or behavior of chemical mixtures (e.g., chemical solutions or suspensions). Briefly, the molecular structure of one or more species in a chemical mixture is assigned one or more different types of “conceptual segments.” An equivalent number is determined for each conceptual segment. This conceptual segment approach of the present invention is referred to as the Non-Random Two-Liquid Segment Activity Coefficient (“NRTL-SAC”) model for nonelectrolytes and as the electrolyte extension of NRTL-SAC (“eNRTL-SAC”) model for electrolytes.

Various NRTL models have been used to model various types of mixtures. Previous segment-based NRTL models used “segments” to define the various chemical species of a mixture. Like the UNIFAC model, these segments were based upon the actual molecular structure of the various chemical species, while the conceptual segments of the present invention are defined based upon actual thermodynamic behavior of the various chemical species.

In some embodiments, this invention features methods of conducting industrial manufacture, research or development. In one embodiment, the methods comprise computer implemented steps of modeling at least one physical property of a mixture of at least two chemical species by determining at least one conceptual segment for each of the chemical species. Determining at least one conceptual segment includes defining an identity and an equivalent number of each conceptual segment.

In one embodiment, the methods of conducting industrial manufacture, research or development further include the steps of: using the determined conceptual segments, computing at least one physical property of the mixture; and b) providing an analysis of the computed physical property. The analysis forms a model of the at least one physical property of the mixture.

In further embodiment, the method of the first embodiment that includes the mixture includes more than one phase and at least a portion of at least one chemical species is in a liquid phase. In one embodiment, the mixture includes any number and combination of vapor, solid, and liquid phase. In some embodiment, the mixture includes at least one liquid phase and at least one solid phase. In yet another embodiment, the mixture includes a first liquid phase, a second liquid phase, and a first chemical species. At least a portion of the first chemical species is dissolved in both the first liquid phase and the second liquid phase.

In further embodiments, the methods of the first embodiment can compute solubility of at least one of the chemical species in at least one phase of the mixture.

In further embodiments, the methods of the first embodiment can define the identity that includes identifying each conceptual segment as one of a hydrophobic segment, a hydrophilic segment, or a polar segment.

The methods of this invention can model a wide range of chemical mixtures of Nonelectrolytes and electrolytes. For example, the chemical mixtures can include one or more of the following types of chemical species: an electrolyte, an organic nonelectrolyte, an organic salt, a compound possessing a net charge, a zwitterions, a polar compound, a nonpolar compound, a hydrophilic compound, a hydrophobic compound, a petrochemical, a hydrocarbon, a halogenated hydrocarbon, an ether, a ketone, an ester, an amide, an alcohol, a glycol, an amine, an acid, water, an alkane, a surfactant, a polymer, and an oligomer.

In further embodiments, the mixture includes at least one chemical species which is a solvent (e.g., a solvent used in a pharmaceutical production, screening, or testing process), a solute, a pharmaceutical component, a compound used in an agricultural application (e.g., a herbicide, a pesticide, or a fertilizer) or a precursor of a compound used in an agricultural application, a compound used in an adhesive composition or a precursor of a compound used in an adhesive composition, a compound used in an ink composition or a precursor of a compound used in an ink composition. As used herein, a “pharmaceutical component” includes a pharmaceutical compound, drug, therapeutic agent, or a precursor thereof (i.e., a compound used as an ingredient in a pharmaceutical compound production process). The “pharmaceutical component” of this invention can be produced by any publicly known method or by any method equivalent with the former. The pharmaceutical agent or other active compound of the present invention may comprise a single pharmaceutical or a combination of pharmaceuticals. These active ingredients may be incorporated in the adhesive layer, backing layer or in both. A pharmaceutical component can also include ingredients for enhancing drug solubility and/or stability of the drug to be added to the layer or layers containing the active ingredient. In some embodiments, the mixture includes at least one pharmaceutical component having a molecular weight greater than about 900 daltons, at least one pharmaceutical component having a molecular weight in the range of between about 100 daltons and about 900 daltons, and/or at least one pharmaceutical component having a molecular weight in the range of between about 200 daltons and about 600 daltons. In further embodiments, the mixture includes at least one nonpolymeric pharmaceutical component.

In further embodiments, the mixture includes at least one ICH solvent, which is a solvent listed in the ICH Harmonized Tripartite Guideline, Impurities: Guideline for Residual Solvents Q3C, incorporated herein in its entirety by reference. ICH STEERING COMMITTEE , ICH Harmonized Tripartite Guideline, Impurities: Guideline for Residual Solvents Q3C, International Conference of Harmonization of Technical Requirements for Registration of Pharmaceuticals for Human Use (1997).

It will be apparent to those skilled in the art that a component of the mixture can belong to more than one type of chemical species.

In accordance with one aspect of the present invention, at least one conceptual segment (e.g., at least 1, 2, 3, 4, 5, 7, 10, 12, or more than 12 conceptual segments) is determined or defined for each of the chemical species of the mixture. The conceptual segments are molecular descriptors of the various molecular species in the mixture. An identity and an equivalent number are determined for each of the conceptual segments. Examples of identities for conceptual segments include a hydrophobic segment, a polar segment, a hydrophilic segment, a charged segment, and the like. Experimental phase equilibrium data can be used to determine the equivalent number of the conceptual segment(s).

The determined conceptual segments are used to compute at least one physical property of the mixture, and an analysis of the computed physical property is provided to form a model of at least one physical property of the mixture. The methods of this invention are able to model a wide variety of physical properties. Examples of physical properties include vapor pressure, solubility (e.g., the equilibrium concentration of one or more chemical species in one or more phases of the mixture), boiling point, freezing point, octanol/water partition coefficient, lipophilicity, and other physical properties that are measured or determined for use in the chemical processes.

Preferably, the methods provide equilibrium values of the physical properties modeled. For example, a mixture can include at least one liquid solvent and at least one solid pharmaceutical component and the methods can be used to model the solubility of the pharmaceutical component. In this way, the methods can provide the concentration of the amount (e.g., a concentration value) of the pharmaceutical component that will be dissolved in the solvent at equilibrium. In another example, the methods can model a mixture that includes a solid phase (e.g., a solid pharmaceutical component) and at least two liquid phases (e.g., two solvent that are immiscible in one another). The model can predict, or be used to predict, how much of the pharmaceutical component will be dissolved in the two liquid phases and how much will be left in the solid phase at equilibrium. In yet a further embodiment, the methods can be used to predict the behavior of a mixture after a change has occurred. For example, if the mixture includes two liquid phases and one solid phase, and an additional chemical species is introduced into the mixture (e.g., a solvent, pharmaceutical component, or other chemical compound), additional amounts of a chemical species are introduced into the mixture, and/or one or more environmental conditions are changes (e.g., a change in temperature and/or pressure), the method can be used to predict how the introduction of the chemical species and/or change in conditions will alter one or more physical properties of the mixture at equilibrium.

The models of the physical property or properties of the mixture are produced by determining the interaction characteristics of the conceptual segments. In some embodiments, the segment-segment interaction characteristics of the conceptual segments are represented by their corresponding binary NRTL parameters. (See Example 11.) Given the NRTL parameters for the conceptual segments and the numbers and types of conceptual segments for the molecules, the NRTL-SAC model computes activity coefficients for the segments and then for the various molecules in the mixture. In other words, the physical properties or behavior of the mixture will be accounted for based on the segment compositions of the molecules and their mutual interactions. The activity coefficient of each molecule is computed from the number and type of segments for each molecule and the corresponding segment activity coefficients.

In one embodiment, the invention features methods of conducting industrial manufacture, research or development where the at least two chemical species includes at least one electrolyte. Electrolytes dissociate to ionic species in solutions. For “strong” electrolytes, the dissociation is “completely” to ionic species. For “weak” electrolytes, the dissociation is partially to ionic species while undissociated electrolytes, similar to nonelectrolytes, remain as neutral molecular species. Complexation of ionic species with solvent molecules or other ionic species may also occur. An implication of the electrolyte solution chemistry is that the extended model should provide a thermodynamically consistent framework to compute activity coefficients for both molecular species and ionic species.

Preferably, the method comprises computer implemented steps of: (a) using the determined conceptual electrolyte segment, computing at least one physical property of the mixture; and (b) providing an analysis of the computed physical property. The analysis forms a model of the at least one physical property of the mixture. The methods of this invention are able to model a wide variety of physical properties involving electrolytes, including activity coefficient, vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, and lipophilicity of the electrolyte.

The computed physical property of the analysis can include at least one of activity coefficient, vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, and lipophilicity of the electrolyte.

In a more preferred embodiment, the step of computing at least one physical property includes calculating the activity coefficient of the ionic species derived from the electrolyte.

In further embodiments, the methods include the electrolyte that is any one of a pharmaceutical compound, a nonpolymeric compound, a polymer, an oligomer, an inorganic compound and an organic compound. In some embodiment, the electrolyte is symmetrical or unsymmetrical. In another embodiment, the electrolyte is univalent or multivalent. In yet another embodiment, the electrolyte includes two or more ionic species.

In some embodiments, the invention features methods of conducting a pharmaceutical activity. In one embodiment, the methods comprise the computer implemented steps of modeling at least one physical property of a mixture of at least two chemical species by determining at least one conceptual segment for each of the chemical species. Determining at least one conceptual segment includes defining an identity and an equivalent number of each conceptual segment.

The term “pharmaceutical activity”, as used herein, has the meaning commonly afforded the term in the art. A pharmaceutical activity can include ones for drug discovery, development or manufacture. Particularly, a pharmaceutical activity can include one that is art, practice, or profession of researching, preparing, preserving, compounding, and dispensing medical drugs and that is of, relating to, or engaged in pharmacy or the manufacture and sale of pharmaceuticals. A pharmaceutical activity further includes the branch of health/medical science and the sector of public life concerned with maintaining or restoring human/mammalian health through the study, diagnosis and treatment of disease and injury. It includes both an area of knowledge—i.e. the chemical make-up of a drug—and the applied practice—i.e. drugs in relation to some diseases and methods of treatment. A pharmaceutical activity can also include at least one of drug design, drug synthesis, drug formulation, drug characterization, drug screen and assay, clinical evaluation, and drug purification. In a more preferred embodiment, the drug synthesis can include distillation, screening, crystallization, filtration, washing, or drying.

In particular, the terms “drug design”, as used herein, has the meaning commonly afforded the term in the art. Drug design can include the approach of finding drugs by design, based on their biological targets. Typically, a drug target is a key molecule involved in a particular metabolic or signaling pathway that is specific to a disease condition or pathology, or to the infectivity or survival of a microbial pathogen. The term “drug characterization”, as used herein, also has the meaning commonly afforded the term in the art. The meaning can include a wide range of analyses to obtain identity, purity, and stability data for new drug substances and formulations, including: structural identity and confirmation, certificates of analyses, purity determinations, stability-indicating methods development and validation identification and quantification of impurities, and residual solvent analyses.

In some embodiment, the pharmaceutical activity can include studies on a molecular interaction within the mixture. The term “study”, used herein, can include an endeavor for acquiring knowledge about a given subject through, for example, an experiment, (i.e. clinical trial). In a preferred embodiment, examples of the studies can include one or more of pharmacokinetics, pharmacodynamics, solvent screening, combination drug therapy, drug toxicity, a process design for an active pharmaceutical ingredient, and chromatography. The cited types of study has the meaning commonly afforded the term in the art.

In further embodiments, the methods of conducting a pharmaceutical activity can comprise the mixture that includes at least one liquid phase. In one embodiment, the methods can include any number and combination of vapor, solid and liquid phases. In another embodiment, the methods include at least one liquid phase and at least one solid phase. In a preferred embodiment, the mixture can include at least one liquid solvent and at least one pharmaceutical component. In a more preferred embodiment, the mixture can include more than one phase and at least a portion of the at least one pharmaceutical component. The pharmaceutical component can be an active pharmaceutical ingredient.

The liquid phase can be an amorphous phase. The term, “an amorphous phase”, used herein, has the meaning commonly afforded the term in the art. An amorphous phase can include a solid in which there is no long-range order of the positions of the atoms. (Solids in which there is long-range atomic order are called crystalline solids.) Most classes of solid materials can be found or prepared in an amorphous form. For instance, common window glass is an amorphous ceramic, many polymers (such as polystyrene) are amorphous, and even foods such as cotton candy are amorphous phase. Amorphous materials are often prepared by rapidly cooling molten material. The cooling reduces the mobility of the material's molecules before they can pack into a more thermodynamically favorable crystalline state. Amorphous materials can also be produced by additives which interfere with the ability of the primary constituent to crystallize. For example addition of soda to silicon dioxide results in window glass and the addition of glycols to water results in a vitrified solid. In a preferred embodiment, at least one of the species in the mixture that is in the amorphous phase is an active pharmaceutical ingredient. In a more preferred embodiment, the method can include a step of estimating an amorphous phase solubility by calculating a phase equilibrium between a solute rich phase and a solvent rich phase.

In some embodiments, the methods of conducting a pharmaceutical activity can include a mixture that has at least one of the at least two chemical species is a pharmaceutical component. In a preferred embodiment, the pharmaceutical component is an active pharmaceutical ingredient.

In some embodiments, the methods of conducting a pharmaceutical activity can further comprise the steps of: (a) using the determined conceptual segments, computing at least one physical property of the mixture; and (b) providing an analysis of the computed physical property. The analysis forms a model of the at least one physical property of the mixture. In a preferred embodiment, the step of defining an identity can include steps of identifying each conceptual segment as one of a hydrophobic segment, a hydrophilic segment, or a polar segment.

In some embodiments, the methods of conducting a pharmaceutical activity can comprise a mixture of at least two chemical species that includes at least one electrolyte. In further embodiments, the methods further include the steps of: a) using the determined conceptual electrolyte segment, computing at least one physical property of the mixture; and b) providing an analysis of the computed physical property. The analysis forms a model of the at least one physical property of the mixture. In one embodiment, the step of computing at least one physical property can include steps of calculating the activity coefficient of the ionic species derived from the electrolyte. In a preferred embodiment, the computed physical property of the analysis can include at least one of activity coefficient, vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, and lipophilicity of the electrolyte.

In further embodiment, the conceptual electrolyte segment can include a cationic segment and an anionic segment, both segments of unity of charge.

In some embodiment, the electrolyte is any one of: a pharmaceutical compound, a nonpolymeric compound, a polymer, an oligomer, an inorganic compound and an organic compound. In one embodiment, the electrolyte is symmetrical or unsymmetrical. In another embodiment, the electrolyte is univalent or multivalent. In yet another embodiment, the electrolyte includes two or more ionic species.

In some embodiments, the invention features methods of separating one or more chemical species from a mixture. The methods include steps of modeling molecular interaction between the chemical species in one or more solvents by determining at least one conceptual segment for each of the species, including defining an identity and an equivalent number of each conceptual segment.

In further embodiments, the methods of separating one or more chemical species from a mixture can use chromatography. In a preferred embodiment, the types of chromatography can include one of the following: capillary-action chromatography, paper chromatography, thin layer chromatography, column chromatography, fast protein liquid chromatography, high performance liquid chromatography, ion exchange chromatography, affinity chromatography, gas chromatography, and countercurrent chromatography. In a more preferred embodiment, the chromatography is high performance liquid chromatography.

In one embodiment, the methods of separating one or more chemical species can comprise a mixture that includes at least one liquid phase. In another embodiment, the method of separating one or more chemical species can comprise a mixture that includes at least one liquid phase and that at least a portion of at least one chemical species is in the liquid phase. In yet another embodiment, at least one of the chemical species of the method of separating one or more chemical species is an active pharmaceutical ingredient.

In some embodiments, the methods of separating one or more chemical species can include the steps of: a) using the determined conceptual segments, computing at least one physical property of the mixture; and b) providing an analysis of the computed physical property. The analysis forms a model of the at least one physical property of the mixture. In further embodiments, the steps of defining an identity can include steps of identifying each conceptual segment as one of a hydrophobic segment, a hydrophilic segment, or a polar segment.

In some embodiments, this invention features computer program products. The computer program products comprise a computer usable medium and a set of computer program instructions embodied on the computer useable medium for conducting industrial manufacture, research or development by modeling at least one physical property of a mixture of at least two chemical species by determining at least one conceptual segment for each of the chemical species. Included are instructions to define an identity and an equivalent number of each of conceptual segment. In a preferred embodiment, the computer usable medium can include a removable storage medium. In a more preferred embodiment, the removable storage medium can include any of a CD-ROM, a DVD-ROM, a diskette, and a tape.

In further embodiment, the computer program products can include: (a) instructions to use the determined conceptual segments to compute at least one physical property of the chemical mixture; and (b) instructions to provide an analysis of the computed physical property. The analysis forms a model of at least one physical property of the mixture.

In one embodiment of the computer program products, at least some portion of the computer program instructions can include instructions to request data or request instructions over a telecommunications network. In another embodiment, at least some portion of the computer program is transmitted over a global network.

In another embodiment of the computer program products, an industrial manufacture, research or development can include a pharmaceutical activity. In a preferred embodiment, the pharmaceutical activity can include one or more of the following: pharmacokinetics, pharmacodynamics, solvent screening, crystallization productivity, drug formulation, combination drug therapy, drug toxicity, a process design for an active pharmaceutical ingredient, capillary-action chromatography, paper chromatography, thin layer chromatography, column chromatography, fast protein liquid chromatography, high performance liquid chromatography, ion exchange chromatography, affinity chromatography, gas chromatography, and countercurrent chromatography.

In some embodiment, the invention features computer systems for conducting industrial manufacture, research or development by modeling at least one physical property of a mixture of at least two chemical species. The computer systems can include: a) a user input means for obtaining chemical data from a user; b) a digital processor coupled to receive obtained chemical data input from the input means; and c) an output means coupled to the digital processor. The digital processor executes a modeling system in working memory, and the output means provides to the user the formed model of the physical property of the mixture. The modeling system may use the chemical data to determine at least one conceptual segment for each of the chemical species, including defining an identity and equivalent number of each conceptual segment.

In further embodiments, the computer system can: a) use the determined conceptual electrolyte segment to compute at least one physical property of the mixture; and b) provide an analysis of the computed physical property. The analysis forms a model of at least one physical property of the mixture. In a preferred embodiment, the computer system can enable transmission of some portion of at least one of the chemical data and the formed model over a global network. Alternatively, the computer system can also conduct industrial manufacture, research or development that includes a pharmaceutical activity. In a preferred embodiment, conducting industrial manufacture, research or development can include one or more of the following: pharmacokinetics, pharmacodynamics, solvent screening, crystallization productivity, drug formulation, combination drug therapy, drug toxicity, a process design for an active pharmaceutical ingredient. capillary-action chromatography, paper chromatography, thin layer chromatography, column chromatography, fast protein liquid chromatography, high performance liquid chromatography, ion exchange chromatography, affinity chromatography, gas chromatography, and countercurrent chromatography.

Reference is now made to a preferred embodiment of the present invention as illustrated in FIGS. 1-4. FIG. 1 illustrates a computer network or similar digital processing environment in which the present invention may be implemented.

Referring to FIG. 1, client computer(s)/devices 50 and server computer(s) 60 provide processing, storage, and input/output devices executing application programs and the like. Client computer(s)/devices 50 can also be linked through communications network 70 to other computing devices, including other client devices/processes 50 and server computer(s) 60. Communications network 70 can be part of a remote access network, a global network (e.g., the Internet), a worldwide collection of computers, Local area or Wide area networks, and gateways that currently use respective protocols (TCP/IP, Bluetooth, etc.) to communicate with one another. Other electronic device/computer network architectures are suitable.

FIG. 2 is a diagram of the internal structure of a computer (e.g., client processor/device 50 or server computers 60) in the computer system of FIG. 1. Each computer 50, 60 contains system bus 79, where a bus is a set of hardware lines used for data transfer among the components of a computer or processing system. Bus 79 is essentially a shared conduit that connects different elements of a computer system (e.g., processor, disk storage, memory, input/output ports, network ports, etc.) that enables the transfer of information between the elements. Attached to system bus 79 is I/O device interface 82 for connecting various input and output devices (e.g., keyboard, mouse, displays, printers, speakers, etc.) to the computer 50, 60. Network interface 86 allows the computer to connect to various other devices attached to a network (e.g., network 70 of FIG. 14). Memory 90 provides volatile storage for computer software instructions 92 and data 94 used to implement an embodiment of the present invention (e.g., NRTL-SAC and eNRTL-SAC in FIGS. 3-4). Disk storage 95 provides non-volatile storage for computer software instructions 92 and data 94 used to implement an embodiment of the present invention. Central processor unit 84 is also attached to system bus 79 and provides for the execution of computer instructions.

In one embodiment, the processor routines 92 and data 94 are a computer program product (generally referenced 92 or 20), including a computer readable medium (e.g., a removable storage medium such as one or more DVD-ROM's, CD-ROM's, diskettes, tapes, etc.) that provides at least a portion of the software instructions for the invention system 20. Computer program product 92 can be installed by any suitable software installation procedure, as is well known in the art. In another embodiment, at least a portion of the software instructions may also be downloaded over a cable, communication and/or wireless connection. In other embodiments, the invention programs are a computer program propagated signal product 107 embodied on a propagated signal on a propagation medium (e.g., a radio wave, an infrared wave, a laser wave, a sound wave, or an electrical wave propagated over a global network such as the Internet, or other network(s)). Such carrier medium or signals provide at least a portion of the software instructions for the present invention routines/program 92.

In alternate embodiments, the propagated signal is an analog carrier wave or digital signal carried on the propagated medium. For example, the propagated signal may be a digitized signal propagated over a global network (e.g., the Internet), a telecommunications network, or other network. In one embodiment, the propagated signal is a signal that is transmitted over the propagation medium over a period of time, such as the instructions for a software application sent in packets over a network over a period of milliseconds, seconds, minutes, or longer. In another embodiment, the computer readable medium of computer program product 92 (e.g., NRTL-SAC or eNRTL-SAC) is a propagation medium that the computer system 50 may receive and read, such as by receiving the propagation medium and identifying a propagated signal embodied in the propagation medium, as described above for computer program propagated signal product.

Generally speaking, the term “carrier medium” or transient carrier encompasses the foregoing transient signals, propagated signals, propagated medium, storage medium and the like.

FIGS. 3 and 4 illustrate data flow and process steps for a model 20 (The model 20 refers to the present invention in context of both NRTL-SAC and eNRTL-SAC models.) performing the methods of the present invention. With reference to FIG. 3, chemical data describing one or more chemical species (e.g., an electrolyte and solvent) of the mixture and/or environmental conditions (e.g., pressure and/or temperature) is entered at step 105 of the modeler process. Step 110 uses that data to determine at least one conceptual segment including a conceptual segment (for nonelectrolytes) or a conceptual electrolyte segment (for electrolytes) for each of the chemical species of the mixture. The determined conceptual segment or electrolyte conceptual segment and other determined conceptual segments are used to compute at least one physical property of the mixture during step 115. The computed physical properties are analyzed to form a model of at least one physical property of the mixture (e.g., solubility of one or more chemical species in one or more phases of the mixture) in step 120. The model information is then given as output at step 125. The output can take the form of data or an analysis appearing on a computer monitor, data or instructions sent to a process control system or device, data entered into a data storage device, and/or data or instructions relayed to additional computer systems or programs.

FIGS. 4 a and 4 b illustrate in more detail the computation at step 115 in FIG. 3. Step 115 begins with the receipt of determined conceptual electrolyte and other segments for each of the chemical species (e.g., nonelectrolytes, electrolyte, solvent, etc.) of the mixture. The determined conceptual segments and the equation:

$\begin{matrix} {{\ln\;\gamma_{m}^{lc}} = {\frac{\sum\limits_{j}{x_{j}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k}G_{km}}} + {\sum\limits_{m^{\prime}}{\frac{x_{m^{\prime}}G_{{mm}^{\prime}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}\left( {\tau_{{mm}^{\prime}} - \frac{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}\tau_{{km}^{\prime}}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}} \right)}}}} & (1) \end{matrix}$ are used to compute at least one physical property of the mixture during step 215 in FIG. 4 a. As for the determined conceptual electrolyte and other segments shown in FIG. 4 b, the equation:

$\begin{matrix} \begin{matrix} {{\ln\;\gamma_{I}^{*}} = {{\ln\;\gamma_{I}^{*{lc}}} + {\ln\;\gamma_{I}^{*{PDH}}} + {\ln\;\gamma_{I}^{*{FH}}} + {\Delta\;\ln\;\gamma_{I}^{Born}}}} \\ {= {{\sum\limits_{m}{r_{m,I}\left( {{\ln\;\Gamma_{m}^{*{lc}}} + {\ln\;\Gamma_{m}^{*{PDH}}}} \right)}} +}} \\ {{r_{c,I}\left( {{\ln\;\Gamma_{c}^{*{lc}}} + {\ln\;\Gamma_{c}^{*{PDH}}} + {{\Delta ln}\;\Gamma_{c}^{Born}}} \right)} +} \\ {{r_{a,I}\left( {{\ln\;\Gamma_{a}^{*{lc}}} + {\ln\;\Gamma_{a}^{*{PDH}}} + {\Delta\;\ln\;\Gamma_{a}^{Born}}} \right)} + {\ln\;\gamma_{I}^{*{FH}}}} \end{matrix} & (2) \end{matrix}$ are used to compute at least one physical property of the mixture during step 215. The computed physical properties are provided as output 220 from computation step 215. In step 220, the computed physical properties are passed to step 120 of FIG. 16 for forming a model of the physical property of the mixture as described above.

The following Examples are illustrative of the invention, and are not meant to be limiting in any way.

EXAMPLE 1 Modeling a Mixture of Nonelectrolyte Chemical Species

A study was performed to determine how well the NRTL-SAC models the solubility of mixtures comprising a solid organic nonelectrolyte.

The solubility of a solid organic nonelectrolyte is described well by the expression:

$\begin{matrix} {{\ln\; x_{I}^{SAT}} = {{\frac{\Delta_{fus}S}{R}\left( {1 - \frac{T_{m}}{T}} \right)} - {\ln\;\gamma_{I}^{SAT}}}} & (3) \end{matrix}$ for T≦T_(m) and where the entropy of fusion of the solid (Δ_(fus)S) is represented by:

$\begin{matrix} {{\Delta_{fus}S} = \frac{\Delta_{fus}H}{T_{m}}} & (4) \end{matrix}$ x_(I) ^(SAT) is the mole fraction of the solid (the solute) dissolved in the solvent phase at saturation, γ_(I) ^(SAT) is the activity coefficient for the solute in the solution at saturation, R is the gas constant, T is the temperature, and T_(m) is the melting point of the solid. Given a polymorph, Δ_(fus)S and T_(m) are fixed and the solubility is then a function of temperature and activity coefficient of the solute in the solution. The activity coefficient of the solute in the solution plays the key role in determining the solubility. In general, the activity coefficient of the solute in the solution is usually calculated from a liquid activity coefficient model.

Except for the ideal solution model, an activity coefficient model is often written in two parts as such: ln γ_(I)=ln γ_(I) ^(C)+ln γ_(I) ^(R)  (5) γ_(I) ^(C) and γ_(I) ^(R) are the combinatorial and residual contributions to the activity coefficient of component I, respectively.

In NRTL-SAC, the combinatorial part, γ_(I) ^(C), is calculated from the Flory-Huggins term for the entropy of mixing. The residual part, γ_(I) ^(R), is set equal to the local composition (lc) interaction contribution, γ_(I) ^(lc):

$\begin{matrix} {\mspace{79mu}{{{\ln\;\gamma_{I}^{R}} = {{\ln\;\gamma_{I}^{lc}} = {\sum\limits_{m}{r_{m,I}\left\lfloor {{\ln\;\gamma_{m}^{lc}} - {\ln\;\gamma_{m}^{{lc},I}}} \right\rfloor}}}}\mspace{20mu}{with}}} & (6) \\ {{{\ln\;\gamma_{m}^{lc}} = {\frac{\sum\limits_{j}{x_{j}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k}G_{km}}} + {\sum\limits_{m^{\prime}}{\frac{x_{m^{\prime}}G_{{mm}^{\prime}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}\left( {\tau_{{mm}^{\prime}} - \frac{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}\tau_{{km}^{\prime}}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}} \right)}}}},{{\ln\;\gamma_{m}^{{lc},I}} = {\frac{\sum\limits_{j}{x_{j,I}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k,I}G_{km}}} + {\sum\limits_{m^{\prime}}{\frac{x_{m^{\prime},I}G_{{mm}^{\prime}}}{\sum\limits_{k}{x_{k,I}G_{{km}^{\prime}}}}\left( {\tau_{{mm}^{\prime}} - \frac{\sum\limits_{k}{x_{k,I}G_{{km}^{\prime}}\tau_{{km}^{\prime}}}}{\sum\limits_{k}{x_{k,I}G_{{km}^{\prime}}}}} \right)}}}},\mspace{79mu}{x_{j} = \frac{\sum\limits_{J}{x_{J}r_{j,J}}}{\sum\limits_{I}{\sum\limits_{i}{x_{I}r_{i,I}}}}},\mspace{20mu}{x_{j,I} = \frac{r_{j,I}}{\sum\limits_{j}r_{j,I}}},} & (7) \end{matrix}$ where i, j, k, m, m′ are the segment-based species index, I,J are the component index, x_(j) is the segment-based mole fraction of segment species j, and x_(J) is the mole fraction of component J, r_(m,I) is the number of segment species m contained in component I, γ_(m) ^(lc) is the activity coefficient of segment species m, and γ_(m) ^(lc,I) is the activity coefficient of segment species m contained only in component I. G and τ are local binary quantities related to each other by the NRTL non-random factor parameter α: G=exp(−ατ)  (8)

The equation:

$\begin{matrix} {{\ln\;\gamma_{I}^{R}} = {{\ln\;\gamma_{I}^{lc}} = {\sum\limits_{m}{r_{m,I}\left\lfloor {{\ln\;\gamma_{m}^{lc}} - {\ln\;\gamma_{m}^{{lc},I}}} \right\rfloor}}}} & (9) \end{matrix}$ is a general form for the local composition interaction contribution to activity coefficients of components in the NRTL-SAC model of the present invention. For mono-segment solvent components (S), this equation can be simplified and reduced to the classical NRTL model as follows:

$\begin{matrix} {{\ln\;\gamma_{I = S}^{lc}} = {\sum\limits_{m}{r_{m,S}\left\lfloor {{\ln\;\gamma_{m}^{lc}} - {\ln\;\gamma_{m}^{{lc},S}}} \right\rfloor}}} & (10) \end{matrix}$ with r _(m,S)=1, ln γ_(m) ^(lc,S)=0.  (11)

Therefore,

$\begin{matrix} {{{\ln\;\gamma_{I = S}^{lc}} = {\frac{\sum\limits_{j}{x_{j}G_{jS}\tau_{jS}}}{\sum\limits_{k}{x_{k}G_{kS}}} + {\sum\limits_{m}{\frac{x_{m}G_{Sm}}{\sum\limits_{k}{x_{k}G_{km}}}\left( {\tau_{Sm} - \frac{\sum\limits_{k}{x_{k}G_{km}\tau_{km}}}{\sum\limits_{k}{x_{k}G_{km}}}} \right)}}}},} & (12) \end{matrix}$ where G _(jS)=exp(−α_(jS)τ_(jS)), G _(Sj)=exp(−α_(jS)τ_(Sj)).  (13) This is the same equation as the classical NRTL model.

Three conceptual segments were defined for nonelectrolyte molecules: a hydrophobic segment, a polar segment, and a hydrophilic segment. These conceptual segments qualitatively capture the phase behavior of real molecules and their corresponding segments. Real molecules in turn are used as reference molecules for the conceptual segments and available phase equilibrium data of these reference molecules are used to identify NRTL binary parameters for the conceptual segments. Preferably, these reference molecules possess distinct molecular characteristics (i.e., hydrophobic, hydrophilic, or polar) and have abundant, publicly available, thermodynamic data (e.g., phase equilibrium data).

The study was focused on the 59 ICH solvents used in pharmaceutical process design. Water, triethylamine, and n-octanol were also considered. Table 1 shows these 62 solvents and the solvent characteristics.

TABLE 1 Common Solvents in Pharmaceutical Process Design Solvent Solvent (Component 1) τ₁₂ ^(a) τ₂₁ ^(a) τ₁₂ ^(b) τ₂₁ ^(b) τ₁₂ ^(c) τ₂₁ ^(c) characteristics ACETIC-ACID 1.365 0.797 2.445 −1.108 Complex ACETONE 0.880 0.935 0.806 1.244 Polar ACETONITRILE 1.834 1.643 0.707 1.787 Polar ANISOLE Hydrophobic BENZENE 1.490 −0.614 3.692 5.977 Hydrophobic 1-BUTANOL −0.113 2.639 0.269 2.870 −2.157 5.843 Hydrophobic/ Hydrophilic 2-BUTANOL −0.165 2.149 −0.168 3.021 −1.539 5.083 Hydrophobic/ Hydrophilic N-BUTYL-ACETATE 1.430 2.131 Hydrophobic/Polar METHYL-TERT-BUTYL- −0.148 0.368 1.534 4.263 Hydrophobic ETHER CARBON-TETRACHLORIDE 1.309 −0.850 5.314 7.369 Hydrophobic CHLOROBENZENE 0.884 −0.194 4.013 7.026 Hydrophobic CHLOROFORM 1.121 −0.424 3.587 4.954 Hydrophobic CUMENE Hydrophobic CYCLOHEXANE −0.824 1.054 6.012 9.519 Hydrophobic 1,2-DICHLOROETHANE 1.576 −0.138 3.207 4.284 2.833 4.783 Hydrophobic 1,1-DICHLOROETHYLENE Hydrophobic 1,2-DICHLOROETHYLENE Hydrophobic DICHLOROMETHANE 0.589 0.325 1.983 3.828 Polar 1,2-DIMETHOXYETHANE 0.450 1.952 Polar N,N-DIMETHYLACETAMIDE −0.564 1.109 Polar N,N-DIMETHYLFORMAMIDE 1.245 1.636 −1.167 2.044 Polar DIMETHYL-SULFOXIDE −2.139 0.955 Polar 1,4-DIOXANE 1.246 0.097 1.003 1.010 Polar ETHANOL 0.533 2.192 −0.024 1.597 Hydrophobic/ Hydrophilic 2-ETHOXYETHANOL −0.319 2.560 −1.593 1.853 Hydrophobic/ Hydrophilic ETHYL-ACETATE 0.771 0.190 0.508 3.828 Hydrophobic/Polar ETHYLENE-GLYCOL 1.380 −1.660 Hydrophilic DIETHYL-ETHER −0.940 1.400 1.612 3.103 Hydrophobic ETHYL-FORMATE Polar FORMAMIDE Complex FORMIC-ACID −0.340 −1.202 Complex N-HEPTANE −0.414 0.398 Hydrophobic N-HEXANE 6.547 10.949 6.547 10.949 Hydrophobic ISOBUTYL-ACETATE Polar ISOPROPYL-ACETATE Polar METHANOL 1.478 1.155 0.103 0.396 Hydrophobic/ Hydrophilic 2-METHOXYETHANOL 1.389 −0.566 Hydrophobic/ Hydrophilic METHYL-ACETATE 0.715 2.751 Polar 3-METHYL-1-BUTANOL 0.062 2.374 −0.042 3.029 −0.598 5.680 Hydrophobic/ Hydrophilic METHYL-BUTYL-KETONE Hydrophobic/Polar METHYLCYCLOHEXANE 1.412 −1.054 Polar METHYL-ETHYL-KETONE −0.036 1.273 0.823 2.128 −0.769 3.883 Hydrophobic/Polar METHYL-ISOBUTYL-KETONE 0.977 4.868 Hydrophobic/Polar ISOBUTANOL 0.021 2.027 0.592 2.702 −1.479 5.269 Hydrophobic/ Hydrophilic N-METHYL-2-PYRROLIDONE −0.583 3.270 −0.235 0.437 Hydrophobic NITROMETHANE 1.968 2.556 Polar N-PENTANE 0.496 −0.523 Hydrophobic 1-PENTANOL −0.320 2.567 −0.029 3.583 Hydrophobic/ Hydrophilic 1-PROPANOL 0.049 2.558 0.197 2.541 Hydrophobic/ Hydrophilic ISOPROPYL-ALCOHOL 0.657 1.099 0.079 2.032 Hydrophobic/ Hydrophilic N-PROPYL-ACETATE 1.409 2.571 Hydrophobic/Polar PYRIDINE −0.665 1.664 −0.990 3.146 Polar SULFOLANE 1.045 0.396 Polar TETRAHYDROFURAN 0.631 1.981 1.773 0.563 Polar 1,2,3,4- 1.134 −0.631 Hydrophobic TETRAHYDRONAPHTHALENE TOLUENE −0.869 1.292 4.241 7.224 Hydrophobic 1,1,1-TRICHLOROETHANE 0.535 −0.197 Hydrophobic TRICHLOROETHYLENE 1.026 −0.560 Hydrophobic M-XYLENE Hydrophobic WATER 10.949 6.547 Hydrophilic TRIETHYLAMINE −0.908 1.285 1.200 1.763 −0.169 4.997 Hydrophobic/Polar 1-OCTANOL −0.888 3.153 0.301 8.939 Hydrophobic/ Hydrophilic Wherein: 1. τ₁₂ ^(a) and τ₂₁ ^(a) are NRTL binary τ parameters for systems of the listed solvents and hexane. NRTL non-random factor parameter, α, is fixed as a constant of 0.2. In these binary systems, solvent is component 1 and hexane component 2. τ's were determined from available VLE & LLE data. 2. τ₁₂ ^(b) and τ₂₁ ^(b) are NRTL binary τ parameters for systems of the listed solvents and water. NRTL non-random factor parameter, α, is fixed as a constant of 0.3. In these binary systems, solvent is component 1 and water component 2. τ's were determined from available VLE data. 3. τ₁₂ ^(c) and τ₂₁ ^(c) are NRTL binary τ parameters for systems of the listed solvents and water. NRTL non-random factor parameter, α, is fixed as a constant of 0.2. In these binary systems, solvent is component 1 and water component 2. τ's were determined from available LLE data.

Hydrocarbon solvents (aliphatic or aromatic), halogenated hydrocarbons, and ethers are mainly hydrophobic. Ketones, esters and amides are both hydrophobic and polar. Alcohols, glycols, and amines may have both substantial hydrophilicity and hydrophobicity. Acids are complex, with hydrophilicity, polarity, and hydrophobicity.

Also shown in Table 1 are the available NRTL binary parameters (τ) for various solvent-water binary systems and solvent-hexane binary systems. Applicants obtained these binary parameters from fitting selected literature phase equilibrium data and deliberately ignoring the temperature dependency of these parameters. These values illustrate the range of values for these binary parameters. Note that many of the binary parameters are missing, as the phase equilibrium data is not found in the literature or simply has never been determined for that solvent mixture. Also note the sheer number of binary parameters needed for the prior art NRTL models for even a moderately sized system of solvents. For example, to model 60 solvents with the NRTL model, 60×60 NRTL binary parameters would be needed.

Table 1 shows that, for the NRTL binary parameters determined from VLE and LLE data for hydrophobic solvent (1)/water (2) binaries, all hydrophobic solvents exhibit similar repulsive interactions with water and both τ₁₂ and τ₂₁ are large positive values for the solvent-water binaries. When the hydrophobic solvents also carry significant hydrophilic or polar characteristics, τ₁₂ becomes negative while τ₂₁ retain a large positive value.

Table 1 also illustrates that similar repulsive, but weaker, interactions between a polar solvent (1) and hexane (2), a representative hydrophobic solvent. Both τ₁₂ and τ₂₁ are small, positive values for the solvent-hexane binaries. The interactions between hydrophobic solvents and hexane are weak and the corresponding NRTL binary parameters are around or less than unity, characteristic of nearly ideal solutions.

The interactions between polar solvents (1) and water (2) are more subtle. While all τ₂₁ are positive, τ₁₂ can be positive or negative. This is probably due to different polar molecules exhibiting different interactions, some repulsive and others attractive, with hydrophilic molecules.

Hexane and water were chosen as the reference molecule for hydrophobic segment and for hydrophilic segment, respectively. The selection of reference molecule for polar segment requires attention to the wide variations of interactions between polar molecules and water. Acetonitrile was chosen as the reference molecule for a polar segment, and a mechanism was introduced to tune the way the polar segment is characterized. The tuning mechanism, as shown in Table 2, allows tuning of the interaction characteristics between the polar segment and the hydrophilic segment. In other words, instead of using only one polar segment (“Y”), two polar segments (“Y−” and “Y+”) were used. The difference between Y− and Y+ is the way they interact with the hydrophilic segment.

The chosen values for the NRTL binary interactions parameters, α and τ, for the three conceptual segments are summarized in Table 2.

TABLE 2 NRTL Binary Parameters for Conceptual Segments in NRTL-SAC Segment X (hydrophobic X (hydrophobic Y−(polar Y+(polar X (hydrophobic (1) segment) segment) segment) segment) segment) Segment Y−(polar Z (hydrophilic Z (hydrophilic Z (hydrophilic Y+(polar (2) segment) segment) segment) segment) segment) τ₁₂ 1.643  6.547 −2.000 2.000 1.643 τ₂₁ 1.834 10.949 1.787 1.787 1.834 α₁₂ = α₂₁ 0.2  0.2 0.3 0.3 0.2

As a first approximation, the temperature dependency of the binary parameters was ignored.

The binary parameters for the hydrophobic segment (1)—hydrophilic segment (2) were determined from available liquid-liquid equilibrium data of hexane-water binary mixture (see Table 1). α was fixed at 0.2 because it is the customary value for α for systems that exhibit liquid-liquid separation. Here both τ₁₂ and τ₂₁ are large positive values (6.547, 10.950). They highlight the strong repulsive nature of the interactions between the hydrophobic segment and the hydrophilic segment.

Determining a suitable value for α is known in the art. See J. M. PRAUSNITZ, ET AL., MOLECULAR THERMODYNAMICS OF FLUID-PHASE EQUILIBRIA 261 (3d ed. 1999).

The binary parameters for the hydrophobic segment (1)-polar segment (2) were determined from available liquid-liquid equilibrium data of hexane-acetonitrile binary mixture (see Table 1). Again, α was fixed at 0.2. Both τ₁₂ and τ₂₁ were small positive values (1.643, 1.834). They highlight the weak repulsive nature of the interactions between hydrophobic segment and polar segment.

The binary parameters for the hydrophilic segment (1)-polar segment (2) were determined from available vapor-liquid equilibrium data of water-acetonitrile binary mixture (see Table 1). α was fixed at 0.3 for the hydrophilic segment-polar segment pair because this binary does not exhibit liquid-liquid separation. τ₁₂ was fixed at a positive value (1.787) and τ₂₁ was allowed to vary between −2 and 2. Two types of polar segments were allowed, Y− and Y+. For Y− polar segment, the values of τ₁₂ and τ₂₁ were (1.787, −2). For Y+ polar segment, they were (1.787, 2). Note that both Y− polar segment and Y+ polar segment exhibited the same repulsive interactions with hydrophobic segments as discussed in the previous paragraph. Also, ideal solution was assumed for Y− polar segment and Y+ polar segment mixtures (i.e., τ₁₂=τ₂₁=0).

Table 2 captures the general trends for the NRTL binary parameters that were observed for a wide variety of hydrophobic, polar, and hydrophilic molecules.

The application of the NRTL-SAC model requires a databank of molecular parameters for common solvents used in the industry. In this example, each solvent was described by using up to four molecular parameters, i.e., X, Y+, Y−, and Z. So, using four molecular parameters to model a system of 60 solvents, a set of up to 4×60 molecular parameters would be used. However, due to the fact that these molecular parameters represent certain unique molecular characteristics, often only one or two molecular parameters are needed for most solvents. For example, alkanes are hydrophobic and they are well represented with hydrophobicity, X, alone. Alcohols are hybrids of hydrophobic segments and hydrophilic segments and they are well represented with X and Z. Ketones, esters, and ethers are polar molecules with varying degrees of hydrophobic contents. They are well represented by X and Y's. Hence, the needed set of molecular parameters can be much smaller than 4×60.

Determination of solvent molecular parameters involves regression of experimental VLE or LLE data for binary systems of interested solvent and the above-mentioned reference molecules (i.e., hexane, acetonitrile, and water) or their substitutes. Solvent molecular parameters are the adjustable parameters in the regression. If binary data is lacking for the solvent with the reference molecules, data for other binaries may be used as long as the molecular parameters for the substitute reference molecules are already identified. In a way, these reference molecules can be thought of as molecular probes that are used to elucidate the interaction characteristics of the solvent molecules. These molecular probes express the interactions in terms of binary phase equilibrium data.

Table 3 lists the molecular parameters identified for the common solvents in the ICH list.

TABLE 3 Molecular Parameters for Common Solvents. Solvent name X Y− Y+ Z ACETIC-ACID 0.045 0.164 0.157 0.217 ACETONE 0.131 0.109 0.513 ACETONITRILE 0.018 0.131 0.883 ANISOLE 0.722 BENZENE 0.607 0.190 1-BUTANOL 0.414 0.007 0.485 2-BUTANOL 0.335 0.082 0.355 N-BUTYL-ACETATE 0.317 0.030 0.330 METHYL-TERT-BUTYL-ETHER 1.040 0.219 0.172 CARBON-TETRACHLORIDE 0.718 0.141 CHLOROBENZENE 0.710 0.424 CHLOROFORM 0.278 0.039 CUMENE 1.208 0.541 CYCLOHEXANE 0.892 1,2-DICHLOROETHANE 0.394 0.691 1,1-DICHLOROETHYLENE 0.529 0.208 1,2-DICHLOROETHYLENE 0.188 0.832 DICHLOROMETHANE 0.321 1.262 1,2-DIMETHOXYETHANE 0.081 0.194 0.858 N,N-DIMETHYLACETAMIDE 0.067 0.030 0.157 N,N-DIMETHYLFORMAMIDE 0.073 0.564 0.372 DIMETHYL-SULFOXIDE 0.532 2.890 1,4-DIOXANE 0.154 0.086 0.401 ETHANOL 0.256 0.081 0.507 2-ETHOXYETHANOL 0.071 0.318 0.237 ETHYL-ACETATE 0.322 0.049 0.421 ETHYLENE-GLYCOL 0.141 0.338 DIETHYL-ETHER 0.448 0.041 0.165 ETHYL-FORMATE 0.257 0.280 FORMAMIDE 0.089 0.341 0.252 FORMIC-ACID 0.707 2.470 N-HEPTANE 1.340 N-HEXANE 1.000 ISOBUTYL-ACETATE 1.660 0.108 ISOPROPYL-ACETATE 0.552 0.154 0.498 METHANOL 0.088 0.149 0.027 0.562 2-METHOXYETHANOL 0.052 0.043 0.251 0.560 METHYL-ACETATE 0.236 0.337 3-METHYL-1-BUTANOL 0.419 0.538 0.314 METHYL-BUTYL-KETONE 0.673 0.224 0.469 METHYLCYCLOHEXANE 1.162 0.251 METHYL-ETHYL-KETONE 0.247 0.036 0.480 METHYL-ISOBUTYL-KETONE 0.673 0.224 0.469 ISOBUTANOL 0.566 0.067 0.485 N-METHYL-2-PYRROLIDONE 0.197 0.322 0.305 NITROMETHANE 0.025 1.216 N-PENTANE 0.898 1-PENTANOL 0.474 0.223 0.426 0.248 1-PROPANOL 0.375 0.030 0.511 ISOPROPYL-ALCOHOL 0.351 0.070 0.003 0.353 N-PROPYL-ACETATE 0.514 0.134 0.587 PYRIDINE 0.205 0.135 0.174 SULFOLANE 0.210 0.457 TETRAHYDROFURAN 0.235 0.040 0.320 1,2,3,4-TETRAHYDRO- 0.443 0.555 NAPHTHALENE TOLUENE 0.604 0.304 1,1,1-TRICHLOROETHANE 0.548 0.287 TRICHLOROETHYLENE 0.426 0.285 M-XYLENE 0.758 0.021 0.316 WATER 1.000 TRIETHYLAMINE 0.557 0.105 1-OCTANOL 0.766 0.032 0.624 0.335

Among the ICH solvents, the molecular parameters identified for anisole, cumene, 1,2-dichloroethylene, 1,2-dimethoxyethane, N,N-dimethylacetamide, dimethyl sulfoxide, ethyl formate, isobutyl acetate, isopropyl acetate, methyl-butyl-ketone, tetralin, and trichloroethylene were questionable, due to lack of sufficient experimental binary phase equilibrium data. In fact, no public data for methyl-butyl-ketone (2-hexanone) was found and its molecular parameters were set to be the same as those for methyl-isobutyl-ketone.

The NRTL-SAC model with the molecular parameters qualitatively captures the interaction characteristics of the solvent mixtures and the resulting phase equilibrium behavior. FIGS. 5 to 7 contain three graphs illustrating the binary phase diagrams for a water, 1,4-dioxane, and octanol system at atmospheric pressure. The graphs illustrate the predictions of both the NRTL model with the binary parameters in Table 1 and NRTL-SAC models with the model parameters of Table 3. FIG. 5 illustrates the water, 1,4-dioxane mixture; FIG. 6 illustrates the water, octanol mixture; and FIG. 7 illustrates the octanol, 1,4-dioxane mixture. The predictions with the NRTL-SAC model are broadly consistent with the calculations from the NRTL model that are generally understood to represent experimental data within engineering accuracy.

EXAMPLE 2 Model Prediction Results

Data compiled by Marrero and Abildsko provides a good source of solubility data for large, complex chemicals. Marrero, J. & Abildskov, J., Solubility and Related Properties of Large Complex Chemicals, Part 1: Organic Solutes Ranging from C ₄ to C ₄₀, CHEMISTRY DATA SERIES XV, DECHEMA, (2003). From that applicants extracted solubility data for the 8 molecules reported by Lin and Nash. Lin, H.-M. & R. A. Nash, An Experimental Method for Determining the Hildebrand Solubility Parameter of Organic Electrolytes, 82 J. PHARMACEUTICAL SCI. 1018 (1993). Also tested, were 6 additional molecules with sizable solubility data sets.

The NRTL-SAC model was applied to the solvents that are included in Table 3. The molecular parameters determined for the solutes are summarized in Table 4. During the data regression, all experimental solubility data, regardless of the order of magnitude, were assigned with a standard deviation of 20%. The comparisons between the experimental solubility and the calculated solubility are given in FIGS. 8 to 21, which illustrate phase diagrams for the systems at 298.15K and atmospheric pressure.

Good representations for the solubility data was obtained with the NRTL-SAC model. The RMS errors in ln x for the fits are given in Table 4.

TABLE 4 Molecular parameters for solutes. RMS # of error on Solute MW solvents T (K) X Y− Y+ Z lnK_(sp) ln x p-Aminobenzoic 137.14  7  298.15 0.218 0.681 1.935 0.760 −2.861 0.284 acid Benzoic acid 122.12  7  298.15 0.524 0.089 0.450 0.405 −1.540 0.160 Camphor 152.23  7  298.15 0.604 0.124 0.478 0.000 −0.593 0.092 Ephedrine 165.23  7  298.15 0.458 0.068 0.000 0.193 −0.296 0.067 Lidocaine 234.33  7  298.15 0.698 0.596 0.293 0.172 −0.978 0.027 Methylparaben 152.14  7  298.15 0.479 0.484 1.218 0.683 −2.103 0.120 Testosterone 288.41  7  298.15 1.051 0.771 0.233 0.669 −3.797 0.334 Theophylline 180.18  7  298.15 0.000 0.757 1.208 0.341 −6.110 0.661 Estriol 288.38  9^(a) 298.15 0.853 0.000 0.291 1.928 −7.652 0.608 Estrone 270.37 12  298.15 0.499 0.679 1.521 0.196 −6.531 0.519 Morphine 285.34  6  308.15 0.773 0.000 0.000 1.811 −4.658 1.007 Piroxicam 331.35 14^(b) 298.15 0.665 0.000 1.803 0.169 −7.656 0.665 Hydrocortisone 362.46 11^(c) 298.15 0.401 0.970 1.248 0.611 −6.697 0.334 Haloperidol 375.86 13^(d) 298.15 0.827 0.000 0.000 0.131 −4.398 0.311 ^(a)With THF excluded. ^(b)With 1,2 dichloroethane, chloroform, diethyl ether, and DMF excluded. ^(c)With hexane excluded. ^(d)With chloroform and DMF excluded. K_(sp), the solubility product constant, corresponds to the ideal solubility (in mole fraction) for the solute. The quality of the fit reflects both the effectiveness of the NRTL-SAC model and the quality of the molecular parameters identified from the limited available experimental data for the solvents.

FIGS. 8, 9, 10, 11, 12, 13, 14 and 15 include graphs illustrating the experimental solubilities vs. calculated solubilities for p-aminobenzoic acid, benzoic acid, camphor, ephedrine, lidocaine, methylparaben, testosterone, and theophylline, respectively, in various solvents at 298.15K. The various solvents used were selected from a group of 33 solvents, including acetic acid, acetone, benzene, 1-butanol, n-butyl acetate, carbon tetrachloride, chlorobenzene, chloroform, cyclohexane, 1,2-dichloroethane, dichloromethane, 1,2-dimethoxyethane, N,N-dimethylformamide, dimethyl-sulfoxide, 1,4-dioxane, ethanol, 2-ethoxyethanol, ethyl acetate, ethylene glycol, diethyl ether, formamide, n-heptane, n-hexane, isopropyl acetate, methanol, methyl acetate, 1-pentanol, 1-propanol, isopropyl alcohol, teterhydrofuran, toluene, water, and 1-octanol. The experimental solubility data was represented well with the NRTL-SAC model.

FIG. 16 includes a graph illustrating the experimental solubilities vs. calculated solubilities for estriol in 9 solvents at 298.15K. The experimental solubility data was represented well with the NRTL-SAC model. The data for tetrahydrofuran is found to be a very significant outlier and it is not included in the 9 solvents shown in FIG. 16.

FIG. 17 includes a graph illustrating the experimental solubilities vs. calculated solubilities for estrone in various solvents at 298.15K. The experimental solubility data was represented well with the NRTL-SAC model.

FIG. 18 includes a graph illustrating the experimental solubilities vs. calculated solubilities for morphine in 6 solvents at 308.15K. Cyclohexane and hexane were outliers. They are very low solubility solvents for morphine and the quality of the data is possibly subject to larger uncertainties.

FIG. 19 illustrates a graph of the experimental solubilities vs. calculated solubilities for piroxicam in 14 solvents at 298.15K. 1,2-dichloroethane, chloroform, diethyl ether, and N,N-dimethylformamide (DMF) were found to be major outliers and are not included in the 14 solvents shown in FIG. 19. Interestingly, Bustamante, et al. also reported 1,2-dichloroethane, chloroform, and diethyl ether as outliers in their study based on solubility parameter models. P. Bustamante, et al., Partial Solubility Parameters of Piroxicam and Niflumic Acid, 1998 INT. J. OF PHARM. 174, 141.

FIG. 20 illustrates a graph of the experimental solubilities vs. calculated solubilities for hydrocortisone in 11 solvents at 298.15K. Hexane is excluded because of the extreme low solubility of hydrocortisone in hexane which could possibly subject the data to larger uncertainty.

FIG. 21 illustrates a graph of the experimental solubilities vs. calculated solubilities for haloperidol in 13 solvents at 298.15K. Haloperidol showed unusually high solubilities in chloroform and DMF and these two solvents are not included in the 13 solvents.

The average RMS error on ln x for the predictions vs. experimental solubility data in Table 4 is 0.37. This corresponds to about ±45% accuracy in solubility predictions.

EXAMPLE 3 Comparison of NRTL-SAC Model to Prior Art Methods for Pharmaceutical Components

The solubilities of various pharmaceutical compounds was modeled with the NRTL-SAC approach of the present invention as well as some prior art models (e.g., the Hanson model and the UNIFAC model) to compare their relative accuracies. The pharmaceutical compounds used included VIOXX®, ARCOXIA®, Lovastatin, Simvastatin, FOSAMAX®. (Available from Merck & Co., Inc., Whitehouse Station, N.J.). The solvents used included water, N,N-Dimethylformamide (“DMF”), 1-propanol, 2-propanol, 1-butanol, toluene, Chloro-benzene, acetonitrile, ethyl acetate, methanol, ethanol, heptane, acetone, and triethylamine (TEA).

Saturated solutions of the compounds in the solvents were allowed to equilibrate for at least 48 hours. Supernatant fluid was filtered and diluted, and a high pressure liquid chromatography (HPLC) concentration analysis was performed to compare the predicted solubility values with actual solubility values.

The NRTL-SAC model of the present invention gave a RMS error on ln x of about 0.5 (i.e., an accuracy and predictive capability of ±˜50%), while the Hansen model had a RMS error on ln x of more than 0.75 and the UNIFAC model had a RMS error on ln x of more than 1.75. Additional comparisons were made for dual-solvent/pharmaceutical systems, and acceptable predictions were obtained from the NRTL-SAC model of the present invention.

These experiments show that the NRTL-SAC model is a simple correlative activity coefficient equation that requires only component-specific molecular parameters (i.e., numbers and types of conceptual segments). Conceptually, the approach suggests that a practitioner account for the liquid ideality of both small solvent molecules and complex pharmaceutical molecules in terms of component-specific molecular parameters (e.g., hydrophobicity, polarity, and hydrophilicity). In practice, these molecular parameters become the adjustable parameters that are determined from selected experimental data. With the development of molecular parameters for solvents and organic solutes, engineering calculations can be performed for various phase equilibrium studies, including solubilities in solvents and solvent mixtures for solvent selection. The NRTL-SAC model provides good qualitative representation on phase behaviors of organic solvents and their complex pharmaceutical solutes and it offers a practical predictive methodology for use in pharmaceutical process design.

EXAMPLE 4 NRTL Segment Activity Coefficient Model

The NRTL-SAC activity coefficient model for component I is composed of the combinatorial term γ_(I) ^(C) and the residual term γ_(I) ^(R): ln γ_(I)=ln γ_(I) ^(C)+ln γ_(I) ^(R)  (14) Here the combinatorial term γ_(I) ^(C) is calculated from the Flory-Huggins equation for the combinatorial entropy of mixing. The residual term γ_(I) ^(R) is calculated from the local composition (lc) interaction contribution γ_(I) ^(lc) of Polymer NRTL (Chen, C.-C., “A Segment-Based Local Composition Model for the Gibbs Energy of Polymer Solutions,” Fluid Phase Equilibria, 83:301, 1993) (herein “Chen 1993”). The Polymer NRTL equation incorporates the segment interaction concept and computes activity coefficient for component I in a solution by summing up contributions to activity coefficient from all segments that make up component I. The equation is given as follows:

$\begin{matrix} {\mspace{79mu}{{{\ln\;\gamma_{I}^{R}} = {{\ln\;\gamma_{I}^{lc}} = {\sum\limits_{m}{r_{m,I}\left\lfloor {{\ln\;\Gamma_{m}^{lc}} - {\ln\;\Gamma_{m}^{{lc},I}}} \right\rfloor}}}}\mspace{20mu}{with}}} & (15) \\ {{\ln\;\Gamma_{m}^{lc}} = {\frac{\sum\limits_{j}{x_{j}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k}G_{km}}} + {\sum\limits_{m^{\prime}}{\frac{x_{m^{\prime}}G_{{mm}^{\prime}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}\left( {\tau_{{mm}^{\prime}} - \frac{\sum\limits_{j}{x_{j}G_{{jm}^{\prime}}\tau_{{jm}^{\prime}}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}} \right)}}}} & (16) \\ {{{\ln\;\Gamma_{m}^{{lc},I}} = {\frac{\sum\limits_{j}{x_{j,I}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k,I}G_{km}}} + {\sum\limits_{m^{\prime}}{\frac{x_{m^{\prime},I}G_{{mm}^{\prime}}}{\sum\limits_{k}{x_{k,I}G_{{km}^{\prime}}}}\left( {\tau_{{mm}^{\prime}} - \frac{\sum\limits_{j}{x_{j,I}G_{{jm}^{\prime}}\tau_{{jm}^{\prime}}}}{\sum\limits_{k}{x_{k,I}G_{{km}^{\prime}}}}} \right)}}}}\mspace{25mu}} & (17) \\ {\mspace{79mu}{x_{j} = {{\frac{\sum\limits_{I}{x_{I}r_{j,I}}}{\sum\limits_{I}{\sum\limits_{i}{x_{I}r_{i,I}}}}\mspace{20mu} x_{j,I}} = \frac{r_{j,I}}{\sum\limits_{j}r_{j,I}}}}} & (18) \end{matrix}$ where I is the component index, i, j, k, m, m′ are the segment species index, x_(I) is the mole fraction of component I, x_(j) is the segment-based mole fraction of segment species j, r_(m,I) is the number of segment species m contained only in component I, Γ_(m) ^(lc) is the activity coefficient of segment species m, and Γ_(m) ^(lc,I) is the activity coefficient of segment species m contained only in component I. G and τ in Eqs. 16 and 17 are local binary quantities related to each other by the NRTL non-random factor parameter α: G=exp(−ατ).  (19)

Four pre-defined conceptual segments were suggested by Chen and Song (2004 above and in parent patent application): one hydrophobic (x), two polar (y− and y+), and one hydrophilic (z). The model molecular parameters, i.e., hydrophobicity X, polarity types Y− and Y+, and hydrophilicity Z, correspond to r_(m,I) (m=x, y−, y+, z), numbers of various conceptual segments in component I.

In the notation used throughout this disclosure, subscript I (upper case) refers to components while subscript i (lower case) refers to segments.

EXAMPLE 5 eNRTL Segment Activity Coefficient Model

The extension of NRTL-SAC model for electrolytes is based on the generalized eNRTL model as summarized by Chen and Song (Chen, C.-C. and Y. Song, “Generalized Electrolyte NRTL Model for Mixed-Solvent Electrolyte Systems,” AIChE J., 50:1928, 2004b; herein incorporated by reference) (herein “Chen, 2004b). Here Applicants briefly present the generalized eNRTL model followed by details of the extended NRTL-SAC model of the present invention.

The generalized eNRTL model is applied to correlate mean ionic activity coefficient of mixed-solvent electrolyte systems. The segment interaction concept provides the framework to explicitly account for the attractive interaction of ions with the hydrophilic segments of organic solvents and the repulsive interaction of ions with the hydrophobic segments of organic solvents. In the generalized eNRTL model, any component, electrolyte or solvent, can be defined as an oligomer consisting of various segment species. For instance, an organic electrolyte species can be defined as an oligomer consisting of cationic segment, anionic segment and molecular segment. An organic solvent can be also defined as an oligomer consisting of multiple molecular segments of different nature. Accordingly, with the conventional activity coefficient accounting for the local interaction (Chen, 1993) and the long-range interaction, the model that uses the unsymmetric Pitzer-Debye-Hückel (PDH) formula (Pitzer, K. S., “Electrolytes: From Dilute Solutions to Fused Salts,” J. Am. Chem. Soc., 102, 2902 (1980)) (herein “Pitzer, 1980”) is calculated as follows:

$\begin{matrix} \begin{matrix} {{\ln\;\gamma_{I}^{*}} = {\frac{1}{RT}\left( \frac{\partial G_{m}^{*{ex}}}{\partial n_{I}} \right)_{T,P,n_{i \neq j}}}} \\ {= {{\frac{1}{RT}\left( \frac{\partial G_{m}^{{*{ex}},{lc}}}{\partial n_{I}} \right)_{T,P,n_{i \neq j}}} + {\frac{1}{RT}\left( \frac{\partial G_{m}^{{*{ex}},{PDH}}}{\partial n_{I}} \right)_{T,P,n_{i \neq j}}}}} \end{matrix} & (20) \end{matrix}$ or ln γ*_(I)=ln γ*_(I) ^(lc)+ln γ*_(I) ^(PDH)  (21) where I is the component index, “*” denotes the unsymmetric convention, γ_(I) is the activity coefficient of the component I in the mixture; R is the gas constant; T is the temperature; P is the pressure; and n_(I) is the mole number of the component I in the mixture. The unsymmetric PDH formula, G*_(m) ^(ex,PDH), is obtained by normalization to mole fractions of unity for solvents and zero for electrolytes (Pitzer, K. S., “Thermodynamics of Electrolytes. I: Theoretical and General Equations,” J. Phys. Chem., 77, 268 (1973)). The local interaction NRTL model, G_(m) ^(ex,lc), is developed as a symmetric model (Chen, C.-C., “A Segment-Based Local Composition Model for the Gibbs Energy of Polymer Solutions,” Fluid Phase Equilib., 83, 301 (1993); and Chen, C.-C., C. P. Bokis, and P. M. Mathias, “A Segment-Based Excess Gibbs Energy Model for Aqueous Organic Electrolyte Systems,” AIChE J., 47, 2593 (2001)), based on the symmetrical reference state so that the derived activity coefficient, is γ_(I) ^(lc)=1 as x_(I)→1 for any component (species). The model is then normalized by the unsymmetric reference state (that is, the infinite-dilution activity coefficient in an aqueous or mixed-solvent solution) to obtain the unsymmetric model G*_(m) ^(ex,lc). Accordingly, the unsymmetric convention activity coefficient is calculated as follows:

$\begin{matrix} {{{\ln\;\gamma_{I}^{*{lc}}} = {{\ln\;\gamma_{I}^{lc}} - {\ln\;\gamma_{I}^{\infty\;{lc}}}}}{{{\ln\;\gamma_{I}^{*{lc}}} = {{{\ln\;\gamma_{I}^{lc}} - {\ln\;\gamma_{I}^{\infty\;{lc}}\ln\;\gamma_{I}^{\;{lc}}}} = {\frac{1}{RT}\left( \frac{\partial G_{m}^{{*{ex}},{lc}}}{\partial n_{I}} \right)_{T,P,n_{i \neq j}}}}},}} & (22) \\ {\gamma_{I}^{\infty}{{{\ln\;\gamma_{I}^{lc}} = {\frac{1}{RT}\left( \frac{\partial G_{m}^{{ex},{lc}}}{\partial n_{I}} \right)_{T,P,n_{i \neq j}}}},}} & (23) \end{matrix}$ where γ_(I) ^(∞) is the infinite-dilution activity coefficient of the ionic component I in an aqueous or mixed-solvent solution as calculated by Equation 23. A more detailed description on the generalized electrolyte-NRTL model is depicted in Chen, 2004b.

This generalized segment interaction concept is advantageous when one must exactly account for the different interaction characteristics that may be attributed to different molecules, solvents or solutes. The ability to exactly account for such different segment-segment interactions between different species in a system is shown to be key for quantitative correlation of mean ionic activity coefficients in mixed-solvent electrolyte systems. In the generalized eNRTL model, however, it is necessary to account for an electrolyte segment for each and every species separately. Therefore, in a system that involves multiple components, there could be tens of different segments to consider and hundreds of segment-segment interactions to account for, and the computation for activity coefficients becomes much more complicated.

Derived from and improved upon the generalized eNRTL model, the electrolyte extension of NRTL model of the present invention provides one conceptual electrolyte segment. A “conceptual electrolyte segment” herein is one predefined electrolyte segment that characterizes the prominent interaction mechanisms between molecules in the liquid phase, that account for the liquid phase nonideality. This pre-defined electrolyte segment is used as a reference against which all electrolyte segments are measured in terms of their liquid phase interaction characteristics. Unlike the generalized eNRTL model, which has no such “conceptual electrolyte segment” as a reference point, surface interaction characteristics of electrolyte segments of the present invention are normalized against the “conceptual electrolyte segment” (in a preferred embodiment, one with interaction characteristics of NaCl) and mathematically expressed as an equivalent number of the reference one. Having a point of reference for the calculation of the electrolyte segment provides a unified and consistent description of liquid phase nonideality of all electrolyte segments and a more intuitive and powerful predictive tool in modeling physical properties including solubility. Together with the numbers of “conceptual” hydrophobic segment, hydrophilic segment and polar segment, the number of “conceptual electrolyte segment” reflects the nature of the surface interactions and their characteristic surface interaction areas that determine their phase behavior.

In the simplest case of a strong electrolyte CA, one may use the following chemical reaction to describe the complete dissociation of the electrolyte: CA→υ _(C) C ^(Z) ^(C) +υ_(A) A ^(Z) ^(A)   (24) with υ_(C) Z _(C)=υ_(A) Z _(A)  (25) where υ_(C) is the cationic stoichiometric coefficient, υ_(A) is the anionic stoichiometric coefficient, Z_(C) is the absolute charge number for cation C, and Z_(A) is the absolute charge number for anion A.

In applying the segment contribution concept to electrolytes, Applicants introduce a new conceptual electrolyte segment e. This conceptual segment e would completely dissociate to a cationic segment (c) and an anionic segment (a), both of unity charge. Applicants then follow the like-ion repulsion and the electroneutrality constraints imposed by the generalized eNRTL model to derive the activity coefficient equations for ionic segments c and a. All electrolytes, organic or inorganic, symmetric or unsymmetric, univalent or multivalent, are to be represented with this conceptual uni-univalent electrolyte segment e together with previously defined hydrophobic segment, x, polar segments, y− and y+, and hydrophilic segment, z. Due to the fact that Applicants introduce only one (a universally useable one) conceptual electrolyte segment e, the resulting eNRTL-SAC model of the present invention is much simpler than the generalized eNRTL model proposed earlier.

EXAMPLE 6 Solubility of an Electrolyte

Described below is the solubility of an electrolyte by the expression:

$\begin{matrix} {{{K_{sp}(T)} = {\prod\limits_{C}\;{x_{C}^{v_{C},{SAT}}\gamma_{C}^{{*v_{C}},{SAT}}{\prod\limits_{A}\;{x_{A}^{v_{A},{SAT}}\gamma_{A}^{{*v_{A}},{SAT}}{\prod\limits_{M}\;{x_{M}^{SAT}\gamma_{M}^{SAT}}}}}}}},} & (26) \end{matrix}$ where Ksp is the solubility product constant for the electrolyte, T is the temperature of the mixture, x_(C) ^(ν) ^(C) ^(SAT) is the mole fraction of a cation derived from the electrolyte at saturation point of the electrolyte, x_(A) ^(ν) ^(A) ^(SAT) is the mole fraction of an anion derived from the electrolyte at saturation point of the electrolyte, x_(M) ^(ν) ^(M) ^(SAT) is the mole fraction of a neutral molecule derived from the electrolyte at saturation point of the electrolyte, γ*_(C) ^(ν) ^(C) ^(,SAT) is the activity coefficient of a cation derived from the electrolyte at the saturation concentration, γ*_(A) ^(ν) ^(A) ^(, SAT) is the activity coefficient of an anion derived from the electrolyte at the saturation concentration, γ*_(M) ^(ν) ^(M) ^(SAT) is the activity coefficient of a neutral molecule derived from the electrolyte at the saturation concentration, C is the cation, A is the anion, M is solvent or solute molecule, T is the temperature of the mixture, γ* is the unsymmetric activity coefficient of a species in solution, SAT is saturation concentration, υ_(C) is the cationic stoichiometric coefficient, υ_(A) is the anionic stoichiometric coefficient, and υ_(M) is the neutral molecule stoichiometric coefficient.

A major consideration in the extension of NRTL-SAC for electrolytes is the treatment of reference state for activity coefficient calculations. While the conventional reference state for nonelectrolyte systems is the pure liquid component, the conventional reference state for electrolytes in solution is the infinite-dilution aqueous solution and the corresponding activity coefficient is “unsymmetric.”

Following the generalized eNRTL model, the logarithm of unsymmetric activity coefficient of an ionic species, ln γ*_(I), is the sum of three terms: the local composition term, ln γ*_(I) ^(lc), the Pitzer-Debye-Hückel term, ln γ*_(I) ^(PDH), and the Flory-Huggins term, ln γ*_(I) ^(FH). ln γ*_(I)=ln γ*_(I) ^(lc)+ln γ*_(I) ^(PDH)+ln γ*_(I) ^(FH)  (27) Eq. 27 applies to aqueous electrolyte systems where water is a sole solvent within the solution. For mixed-solvent solutions, the Born term, Δ ln γ_(I) ^(Born), is used to correct the change of the infinite dilution reference state from the mixed-solvent composition to the aqueous solution for the Pitzer-Debye-Hückel term: ln γ*_(I)=ln γ*_(I) ^(lc)+ln γ*_(I) ^(PDH)+ln γ*_(I) ^(FH)+Δ ln γ^(Born)  (28) Since Applicants adopt the aqueous phase infinite dilution reference state for γ*_(I), the Born term correction is required for non-aqueous systems.

With the introduction of the conceptual electrolyte segment e and the corresponding conceptual ionic segments c and a, one can rewrite Eq. 28 in terms of contributions from all conceptual segments:

$\begin{matrix} \begin{matrix} {{\ln\;\gamma_{I}^{*}} = {{\ln\;\gamma_{I}^{*{lc}}} + {\ln\;\gamma_{I}^{*{PDH}}} + {\ln\;\gamma_{I}^{*{FH}}} + {{\Delta ln}\;\gamma_{I}^{Born}}}} \\ {= {{\sum\limits_{m}\;{r_{m,I}\left( {{\ln\;\Gamma_{m}^{*{lc}}} + {\ln\;\Gamma_{m}^{*{PDH}}}} \right)}} +}} \\ {{r_{c,I}\left( {{\ln\;\Gamma_{c}^{*{lc}}} + {\ln\;\Gamma_{c}^{*{PDH}}} + {{\Delta ln}\;\Gamma_{c}^{Born}}} \right)} +} \\ {{r_{a,I}\left( {{\ln\;\Gamma_{a}^{*{lc}}} + {\ln\;\Gamma_{a}^{*{PDH}}} + {{\Delta ln}\;\Gamma_{a}^{Born}}} \right)} + {\ln\;\gamma_{I}^{*{FH}}}} \end{matrix} & (29) \end{matrix}$ where r is the segment number, m is the conceptual molecular segment index (i.e., m=x, y−, y+, z), c and a are cationic and anionic segments, respectively, resulting from the dissociation of the conceptual electrolyte segment e. Also notice that in Eq. 29, unlike the local composition term and the long range ion-ion interaction terms, the Flory-Huggins term remains as the component-based contribution.

For systems of single electrolyte CA with a segment number r_(e), r_(c) and r_(a) must satisfy electroneutrality and they can be computed from r_(e), Z_(C), and Z_(A). r _(c,C) =r _(e,CA) Z _(C)  (30) r _(a,A) =r _(e,CA) Z _(A)  (31)

For systems of multiple electrolytes, the mixing rule is needed to compute segment number r_(c) and r_(a) for each cation C and anion A.

$\begin{matrix} {r_{c,C} = {\sum\limits_{A}\;{r_{e,{CA}}{Z_{C}\left( {x_{A}{Z_{A}/{\sum\limits_{A^{\prime}}\;{x_{A^{\prime}}Z_{A^{\prime}}}}}} \right)}}}} & (32) \\ {r_{a,A} = {\sum\limits_{C}\;{r_{e,{CA}}{Z_{A}\left( {x_{C}{Z_{C}/{\sum\limits_{C^{\prime}}\;{x_{C^{\prime}}Z_{C^{\prime}}}}}} \right)}}}} & (33) \end{matrix}$

r_(e,CA), the number of conceptual electrolyte segment e in electrolyte CA, becomes the new model parameter for electrolytes. For the sake of brevity, Applicants call r_(e,CA) parameter E, the electrolyte segment number.

EXAMPLE 7 Local Composition Interaction Contribution

To derive the expression for the local composition interaction contribution, Applicants simplify the generalized excess Gibbs energy expression of the prior Chen and Song model (Chen, 2004b) for systems with multiple molecular segments m and single electrolyte segment e. The single electrolyte segment e is then decomposed into a cationic segment c and an anionic segment a:

$\begin{matrix} {{\frac{G^{{ex},{lc}}}{nRT} = {\sum\limits_{I}\;\begin{bmatrix} {{\sum\limits_{m}\;{r_{m,I}{x_{I}\left( \frac{\sum\limits_{j}{x_{j}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k}G_{km}}} \right)}}} +} \\ {{r_{c,I}{x_{I}\left( \frac{\sum\limits_{j}{x_{j}G_{{jc},{ac}}\tau_{{jc},{ac}}}}{\sum\limits_{k}{x_{k}G_{{kc},{ac}}}} \right)}} + {r_{a,I}{x_{I}\left( \frac{\sum\limits_{j}{x_{j}G_{{ja},{ca}}\tau_{{ja},{ca}}}}{\sum\limits_{k}{x_{k}G_{{ka},{ca}}}} \right)}}} \end{bmatrix}}}\mspace{160mu}{with}} & (34) \\ {\mspace{79mu}{{x_{j} = {\frac{\sum\limits_{I}{x_{I}r_{j,I}}}{\sum\limits_{I}{\sum\limits_{i}{x_{I}r_{i,I}}}}\mspace{14mu} i}},{j = m},c,a}} & (35) \end{matrix}$ where G^(ex,lc) is the excess Gibbs energy from local composition interactions, n is the total mole number, R is the gas constant and T is the temperature.

To derive the segment activity coefficient, one can rewrite Eq. 34 as follows:

$\begin{matrix} {\frac{G^{{ex},{lc}}}{n_{S}{RT}} = {{\sum\limits_{m}{x_{m}\left( \frac{\sum\limits_{j}{x_{j}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k}G_{km}}} \right)}} + {x_{c}\left( \frac{\sum\limits_{j}{x_{j}G_{{jc},{ac}}\tau_{{jc},{ac}}}}{\sum\limits_{k}{x_{k}G_{{kc},{ac}}}} \right)} + {x_{a}\left( \frac{\sum\limits_{j}{x_{j}G_{{ja},{ca}}\tau_{{ja},{ca}}}}{\sum\limits_{k}{x_{k}G_{{ka},{ca}}}} \right)}}} & (36) \end{matrix}$ where n_(S) is the total number of all segments. Accordingly, the segment activity coefficient can be calculated as follows:

$\begin{matrix} {{{\ln\;\Gamma_{j}^{lc}} = {\frac{1}{RT}\left( \frac{\partial G^{{ex},{lc}}}{\partial n_{j}} \right)_{T,P,n_{i \neq j}}\mspace{14mu} i}},{j = m},c,a} & (37) \end{matrix}$ Specifically, the activity coefficients from Eq. 37 for molecular segments, cationic segment, and anionic segment can be carried out as follows:

$\begin{matrix} {{\ln\;\Gamma_{m}^{lc}} = {\frac{\sum\limits_{j}{x_{j}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k}G_{km}}} + {\sum\limits_{m^{\prime}}{\frac{x_{m^{\prime}}G_{{mm}^{\prime}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}\left( {\tau_{{mm}^{\prime}} - \frac{\sum\limits_{j}{x_{j}G_{{jm}^{\prime}}\tau_{{jm}^{\prime}}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}} \right)}} + {\frac{x_{c}G_{{mc},{ac}}}{\sum\limits_{k}{x_{k}G_{{kc},{ac}}}}\left( {\tau_{{mc},{ac}} - \frac{\sum\limits_{j}{x_{j}G_{{jc},{ac}}\tau_{{jc},{ac}}}}{\sum\limits_{k}{x_{k}G_{{kc},{ac}}}}} \right)} + {\frac{x_{a}G_{{ma},{ca}}}{\sum\limits_{k}{x_{k}G_{{ka},{ca}}}}\left( {\tau_{{ma},{ca}} - \frac{\sum\limits_{j}{x_{j}G_{{ja},{ca}}\tau_{{ja},{ca}}}}{\sum\limits_{k}{x_{k}G_{{ka},{ca}}}}} \right)}}} & (38) \\ {{\ln\;\Gamma_{c}^{lc}} = {{\sum\limits_{m}{\frac{x_{m}G_{cm}}{\sum\limits_{k}\;{x_{k}G_{km}}}\left( {\tau_{cm} - \frac{\sum\limits_{j}{x_{j}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k}G_{km}}}} \right)}} + \frac{\sum\limits_{j}{x_{j}G_{{jc},{ac}}\tau_{{jc},{ac}}}}{\sum\limits_{k}{x_{k}G_{{kc},{ac}}}} - {\frac{x_{a}}{\sum\limits_{k}{x_{k}G_{{ka},{ca}}}}\left( \frac{\sum\limits_{j}{x_{j}G_{{ja},{ca}}\tau_{{ja},{ca}}}}{\sum\limits_{k}{x_{k}G_{{ka},{ca}}}} \right)}}} & (39) \\ {{\ln\;\Gamma_{a}^{lc}} = {{\sum\limits_{m}{\frac{x_{m}G_{am}}{\sum\limits_{k}\;{x_{k}G_{km}}}\left( {\tau_{am} - \frac{\sum\limits_{j}{x_{j}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k}G_{km}}}} \right)}} + \frac{\sum\limits_{j}{x_{j}G_{{ja},{ca}}\tau_{{ja},{ca}}}}{\sum\limits_{k}{x_{k}G_{{ka},{ca}}}} - {\frac{x_{c}}{\sum\limits_{k}{x_{k}G_{{kc},{ac}}}}\left( \frac{\sum\limits_{m}{x_{m}G_{{mc},{ac}}\tau_{{mc},{ac}}}}{\sum\limits_{k}{x_{k}G_{{kc},{ac}}}} \right)}}} & (40) \end{matrix}$

The local composition term for the logarithm of activity coefficient of component I is computed as the sum of the individual segment contributions.

$\begin{matrix} \begin{matrix} {{\ln\;\gamma_{I}^{lc}} = {\sum\limits_{i}{r_{i,I}\ln\;\Gamma_{i}^{lc}}}} & {{i = m},c,a} \\ {{\sum\limits_{m}{r_{m,I}\ln\;\Gamma_{m}^{lc}}} + {r_{c,I}\ln\;\Gamma_{c}^{lc}} + {r_{a,I}\ln\;\Gamma_{a}^{lc}}} &  \end{matrix} & (41) \end{matrix}$

However, the activity coefficient by Eq. 41 needs to be further normalized so that γ_(I) ^(lc)=1 as x_(I)→1 for any component; this is the so-called symmetric reference state. The normalization can be done as follows:

$\begin{matrix} \begin{matrix} {{\ln\;\gamma_{I}^{lc}} = {\sum\limits_{i}{r_{i,I}\left\lbrack {{\ln\;\Gamma_{i}^{lc}} - {\ln\;\Gamma_{i}^{{lc},I}}} \right\rbrack}}} & {{i = m},c,a} \\ {= {{\sum\limits_{m}{r_{m,I}\left\lbrack {{\ln\;\Gamma_{m}^{lc}} - {\ln\;\Gamma_{m}^{{lc},I}}} \right\rbrack}} +}} & \\ {{r_{c,I}\left\lbrack {{\ln\;\Gamma_{c}^{lc}} - {\ln\;\Gamma_{c}^{{lc},I}}} \right\rbrack} +} & \\ {r_{a,I}\left\lbrack {{\ln\;\Gamma_{a}^{lc}} - {\ln\;\Gamma_{a}^{{lc},I}}} \right\rbrack} &  \end{matrix} & (42) \end{matrix}$ Here Γ_(i) ^(lc,I) is the activity coefficient of the segment i contained in the symmetric reference state of component I; it can be calculated from Eqs. 38-40 by setting x₁=1: ln Γ_(i) ^(lc,I)=ln Γ_(i) ^(lc)(x _(I)=1) i=m,c,a  (43)

Finally, the unsymmetric convention in Eq. 28 requires us to compute the infinite-dilution activity coefficient, γ_(I) ^(∞lc), for a component: ln γ*_(I) ^(lc)=ln γ_(I) ^(lc)−ln γ_(I) ^(∞lc)  (44) with

$\begin{matrix} \begin{matrix} {{\ln\;\gamma_{I}^{\infty\;{lc}}} = {\sum\limits_{i}{r_{i,I}\left\lbrack {{\ln\;\Gamma_{i}^{\infty\;{lc}}} - {\ln\;\Gamma_{i}^{{lc},I}}} \right\rbrack}}} & {{i = m},c,a} \\ {= {{\sum\limits_{m}{r_{m,I}\left\lbrack {{\ln\;\Gamma_{m}^{\infty\;{lc}}} - {\ln\;\Gamma_{m}^{{lc},I}}} \right\rbrack}} +}} & \\ {{r_{c,I}\left\lbrack {{\ln\;\Gamma_{c}^{\infty\;{lc}}} - {\ln\;\Gamma_{c}^{{lc},I}}} \right\rbrack} -} & \\ {r_{a,I}\left\lbrack {{\ln\;\Gamma_{a}^{\infty\;{lc}}} - {\ln\;\Gamma_{a}^{{lc},I}}} \right\rbrack} &  \end{matrix} & (45) \end{matrix}$ Combining Eqs. 42 and 45, one can obtain:

$\begin{matrix} \begin{matrix} {{\ln\;\gamma_{I}^{*{lc}}} = {{\ln\;\gamma_{I}^{lc}} - {\ln\;\gamma_{I}^{\infty\;{lc}}}}} & {{i = m},c,a} \\ {= {\sum\limits_{i}{r_{i,I}\left\lbrack {{\ln\;\Gamma_{i}^{lc}} - {\ln\;\Gamma_{i}^{\infty\;{lc}}}} \right\rbrack}}} & \\ {= {{\sum\limits_{m}{r_{m,I}\left\lbrack {{\ln\;\Gamma_{m}^{lc}} - {\ln\;\Gamma_{m}^{\infty\;{lc}}}} \right\rbrack}} +}} & \\ {{r_{c,I}\left\lbrack {{\ln\;\Gamma_{c}^{lc}} - {\ln\;\Gamma_{c}^{\infty\;{lc}}}} \right\rbrack} +} & \\ {r_{a,I}\left\lbrack {{\ln\;\Gamma_{a}^{lc}} - {\ln\;\Gamma_{a}^{\infty\;{lc}}}} \right\rbrack} & \\ {= {{\sum\limits_{m}{r_{m,I}\ln\;\Gamma_{m}^{*{lc}}}} + {r_{c,I}\ln\;\Gamma_{c}^{*{lc}}} + {r_{a,I}\ln\;\Gamma_{a}^{*{lc}}}}} &  \end{matrix} & (46) \end{matrix}$ with ln Γ*_(m) ^(lc)=ln Γ_(m) ^(lc)−ln Γ_(m) ^(∞lc)  (47) ln Γ*_(c) ^(lc)=ln Γ_(c) ^(lc)−ln Γ_(c) ^(∞lc)  (48) ln Γ*_(a) ^(lc)=ln Γ_(a) ^(lc)−ln Γ_(a) ^(∞lc)  (49)

Because Applicants adopt the aqueous phase infinite dilution reference state, the infinite-dilution activity coefficients of conceptual segments can be calculated from Eqs. 38-41 by setting x_(W)=1: ln Γ_(i) ^(∞lc)=ln Γ_(i) ^(lc)(x _(W)=1) i=m,c,a  (50) where x_(W) is the mole fraction of water in the solution.

EXAMPLE 8 Long-Range Interaction Contribution from Pitzer-Debye-Hückel (PDH) Model

To account for the long-range ion-ion interactions, the present invention eNRTL-SAC model uses the unsymmetric Pitzer-Debye-Hückel (PDH) formula (Pitzer, 1980) on the segment basis:

$\begin{matrix} {{\frac{G^{{*{ex}},{PDH}}}{n_{S}{RT}} = {{- \left( \frac{1000}{\overset{\_}{M_{S}}} \right)^{\frac{1}{2}}}\left( \frac{4\; A_{\varphi}I_{x}}{\rho} \right){\ln\left( {1 + {\rho\; I_{x}^{\frac{1}{2}}}} \right)}}}{with}} & (51) \\ {A_{\varphi} = {\frac{1}{3}\left( \frac{2\pi\; N_{A}{\overset{\_}{d}}_{S}}{1000} \right)^{\frac{1}{2}}\left( \frac{Q_{e}^{2}}{{\overset{\_}{ɛ}}_{S}k_{B}T} \right)^{\frac{3}{2}}}} & (52) \\ {I_{x} = {\frac{1}{2}{\sum\limits_{i}\;{x_{i}z_{i}^{2}}}}} & (53) \end{matrix}$ where A_(φ) is the Debye-Hückel parameter, I_(x) is the ionic strength (segment mole fraction scale), M _(S) is the average molecular weight of the mixed-solvents, ρ is the closest approach parameter, N_(A) is the Avogadro's number, d _(S) is the average density of the mixed-solvents, Q_(e) is the electron charge, ∈ _(S) is the average dielectric constant of the mixed-solvents, k_(B) is the Boltzmann constant, and z_(i) (z_(m)=0; z_(c)=z_(a)=1) is the charge number of segment-based species i.

Applying the PDH model to the conceptual segments, the activity coefficient of segment species i can be derived as follows:

$\begin{matrix} \begin{matrix} {{\ln\;\Gamma_{i}^{*{PDH}}} = {\frac{1}{RT}\left( \frac{\partial G^{{*{ex}},{PDH}}}{\partial n_{i}} \right)_{T,P,n_{j \neq i}}}} & {i,{j = m},c,a} \\ {= {{- \left( \frac{1000}{\overset{\_}{M_{S}}} \right)^{\frac{1}{2}}}{A_{\varphi}\left\lbrack {{\left( \frac{2\; z_{i}^{2}}{\rho} \right){\ln\left( {1 + {\rho\; I_{x}^{\frac{1}{2}}}} \right)}} + \frac{{z_{i}^{2}I_{x}^{\frac{1}{2}}} - {2\; I_{x}^{\frac{3}{2}}}}{1 + {\rho\; I_{x}^{\frac{1}{2}}}}} \right\rbrack}}} & \; \end{matrix} & (54) \end{matrix}$

The unsymmetric long range term for the logarithm of activity coefficient of component I is the sum of contributions from its various segments:

$\begin{matrix} {\mspace{79mu}{{{\ln\;\gamma_{I}^{*{PDH}}} = {{\sum\limits_{m}{r_{m,I}\ln\;\Gamma_{m}^{*{PDH}}}} + {r_{c,I}\ln\;\Gamma_{c}^{*{PDH}}} + {r_{a,I}\ln\;\Gamma_{a}^{*{PDH}}}}}\mspace{79mu}{where}}} & (55) \\ {\mspace{79mu}{{\ln\;\Gamma_{m}^{*{PDH}}} = {2\left( \frac{1000}{\overset{\_}{M_{S}}} \right)^{\frac{1}{2}}\frac{A_{\varphi}I_{x}^{\frac{3}{2}}}{1 + {\rho\; I_{x}^{\frac{1}{2}}}}}}} & (56) \\ {{{\ln\;\Gamma_{c}^{*{PDH}}} = {{\ln\;\Gamma_{a}^{*{PDH}}} = {{- \left( \frac{1000}{\overset{\_}{M_{S}}} \right)^{\frac{1}{2}}}{A_{\varphi}\left\lbrack {{\left( \frac{2}{\rho} \right){\ln\left( {1 + {\rho\; I_{x}^{\frac{1}{2}}}} \right)}} + \frac{I_{x}^{\frac{1}{2}} - {2\; I_{x}^{\frac{3}{2}}}}{1 + {\rho\; I_{x}^{\frac{1}{2}}}}} \right\rbrack}}}}\mspace{79mu}{With}} & (57) \\ {\mspace{79mu}{A_{\varphi} = {\frac{1}{3}\left( \frac{2\pi\; N_{A}{\overset{\_}{d}}_{S}}{1000} \right)^{\frac{1}{2}}\left( \frac{Q_{e}^{2}}{{\overset{\_}{ɛ}}_{S}k_{B}T} \right)^{\frac{3}{2}}}}} & (58) \\ {\mspace{79mu}{I_{x} = {\frac{1}{2}\left( {x_{c} + x_{a}} \right)}}} & (59) \end{matrix}$

The Debye-Hückel theory is based on the infinite dilution reference state for ionic species in the actual solvent media. For systems with water as the only solvent, the reference state is the infinite dilution aqueous solution. For mixed-solvent systems, the reference state for which the Pitzer-Debye-Hückel formula remains valid is the infinite dilution solution with the corresponding mixed-solvent composition. Consequently, the molecular quantities for the single solvent need to be extended for mixed-solvents; simple composition average mixing rules are adequate to calculate them as follows:

$\begin{matrix} {{\overset{\_}{M}}_{S} = {\sum\limits_{S}\;{x_{S}^{\prime}M_{S}}}} & (60) \\ {\frac{1}{{\overset{\_}{d}}_{S}} = {\sum\limits_{S}\frac{x_{S}^{\prime}}{d_{S}}}} & (61) \\ {{{\overset{\_}{ɛ}}_{S} = {\sum\limits_{S}{w_{S}^{\prime}ɛ_{S}}}}{with}} & (62) \\ {x_{S}^{\prime} = \frac{x_{S}}{\sum\limits_{S}x_{S}}} & (63) \\ {w_{S}^{\prime} = \frac{M_{s}x_{s}}{\sum\limits_{S}{M_{S}x_{S}}}} & (64) \end{matrix}$ where S is a solvent component in the mixture, and M_(S) is the molecular weight of the solvent S. It should be pointed out that Eqs. 60-64 should be used only in Eq. 54 and M _(S), d _(S), and ∈ _(S) were already assumed as constants in Eqs. 51 and 52 when deriving Eq. 54 for mixed-solvent systems. Table 1 shows the values of dielectric constant at 298.15 K used in this study for the same sixty-two solvents investigated by Chen and Song (Chen, 2004a and U.S. Publication No. 2005/0187748) above. These values were compiled from various sources including internet websites and commercial software Aspen Properties v2004.1 (by Aspen Technology, Inc. of Cambridge, Mass., assignee of the present invention).

EXAMPLE 9 Born Term Correction to Activity Coefficient

Given that the infinite dilution aqueous solution is chosen as the reference state, one needs to correct the change of the reference state from the mixed-solvent composition to aqueous solution for the Pitzer-Debye-Hückel term. The Born term (Robinson, R. A. and R. H. Stokes, Electrolyte Solutions, 2^(nd) ed., Butterworths (1970), Rashin, A. A. and B. Honig, “Reevaluation of the Born Model of Ion Hydration, J. Phys. Chem., 89: 5588 (1985)) on the segment basis is used for this purpose:

$\begin{matrix} {\frac{\Delta\; G^{Born}}{n_{S}{RT}} = {\frac{Q_{e}^{2}}{2\; k_{B}T}\left( {\frac{1}{{\overset{\_}{ɛ}}_{S}} - \frac{1}{ɛ_{W}}} \right){\sum\limits_{i}\;{\frac{x_{i}z_{i}^{2}}{r_{i}}10^{- 2}}}}} & (65) \end{matrix}$ ΔG^(Born) is the Born term correction to the unsymmetric Pitzer-Debye-Hückel formula, G*^(ex,PDH), ∈_(W) is the dielectric constant of water, and r_(i) is the Born radius of segment specie i.

Applying Eq. 65 to all conceptual segments, the corresponding expression for the activity coefficient of segment species i can be derived as follows:

$\begin{matrix} {\mspace{79mu}{{{{\Delta ln\Gamma}_{m}^{Born} = {{\frac{1}{RT}\left( \frac{{\partial\Delta}\; G^{Born}}{\partial n_{m}} \right)_{T,P,n_{j \neq m}}} = 0}}\mspace{79mu}{m = x}},{y -},{y +},z}} & (66) \\ {{{\Delta ln\Gamma}_{i}^{Born} = {{\frac{1}{RT}\left( \frac{{\partial\Delta}\; G^{Born}}{\partial n_{i}} \right)_{T,P,n_{j \neq i}}} = {\frac{Q_{e}^{2}}{2k_{B}T}\left( {\frac{1}{{\overset{\_}{ɛ}}_{S}} - \frac{1}{ɛ_{W}}} \right)\frac{z_{i}^{2}}{r_{i}}10^{- 2}}}}\mspace{79mu}{{i = c},a}} & (67) \end{matrix}$

The Born correction term on the logarithm of activity coefficient of component I is the sum of contributions from its various segments:

$\begin{matrix} {{\Delta\;\ln\;\gamma_{I}^{Born}} = {{r_{c,I}{\Delta ln\Gamma}_{c}^{Born}} + {r_{a,I}{\Delta ln\Gamma}_{a}^{Born}}}} & (68) \\ {{\Delta ln\Gamma}_{c}^{Born} = {\frac{Q_{e}^{2}}{2k_{B}T}\left( {\frac{1}{{\overset{\_}{ɛ}}_{S}} - \frac{1}{ɛ_{W}}} \right)\frac{1}{r_{c}}10^{- 2}}} & (69) \\ {{\Delta ln\Gamma}_{a}^{Born} = {\frac{Q_{e}^{2}}{2k_{B}T}\left( {\frac{1}{{\overset{\_}{ɛ}}_{S}} - \frac{1}{ɛ_{W}}} \right)\frac{1}{r_{a}}10^{- 2}}} & (70) \end{matrix}$

EXAMPLE 10 Flory-Huggins Term Correction to Activity Coefficient

Although in most common electrolyte systems, the combinatorial entropy of mixing term is much smaller than the residual term, one may still want to include it in a general model. Applicants follow the Polymer NRTL model (Chen 1993 above) and use the Flory-Huggins term to describe the combinatorial term:

$\begin{matrix} {{\frac{G^{{ex},{FH}}}{nRT} = {\sum\limits_{I}{x_{I}{\ln\left( \frac{\phi_{I}}{x_{I}} \right)}}}}{with}} & (71) \\ {\phi_{I} = \frac{x_{I}r_{I}}{\sum\limits_{J}{x_{J}r_{J}}}} & (72) \end{matrix}$ where G^(ex,FH) is the Flory-Huggins term for the excess Gibbs energy, φ_(I) is the segment fraction of component I, and r_(I) is the number of all conceptual segments in component I:

$\begin{matrix} {r_{I} = {{\sum\limits_{m}r_{m,I}} + r_{c,I} + r_{a,I}}} & (73) \end{matrix}$

The activity coefficient of component I from the combinatorial term can be derived from Eq. 60:

$\begin{matrix} {{\ln\;\gamma_{I}^{FH}} = {{{\ln\left( \frac{\phi_{I}}{x_{I}} \right)} + 1 - {r_{I}{\sum\limits_{J}\frac{\phi_{J}}{r_{J}}}}} = {{\ln\left( \frac{r_{I}}{\sum\limits_{J}{x_{J}r_{J}}} \right)} + 1 - \frac{r_{I}}{\sum\limits_{J}{x_{J}r_{J}}}}}} & (74) \end{matrix}$

The infinite-dilution activity coefficient of a component in water is:

$\begin{matrix} {{\ln\;\gamma_{I}^{\infty\;{FH}}} = {{\ln\left( \frac{r_{I}}{r_{W}} \right)} + 1 - \frac{r_{I}}{r_{W}}}} & (75) \end{matrix}$

In both NRTL-SAC (parent patent application) and present invention eNRTL-SAC, water is selected as the reference for the hydrophilic segment z. Therefore, one can set r_(W)=1. Thus, one has: ln γ_(I) ^(∞FH)=ln r_(I)+1−r _(I)  (76) One can then compute the unsymmetric activity coefficient from the Flory-Huggins term as follows:

$\begin{matrix} {{\ln\;\gamma_{I}^{*{,{FH}}}} = {{{\ln\;\gamma_{I}^{FH}} - {\ln\;\gamma_{I}^{\infty\;{FH}}}} = {r_{I} - {\ln\left( {\sum\limits_{J}{x_{J}r_{J}}} \right)} - \frac{r_{I}}{\sum\limits_{J}{x_{J}r_{J}}}}}} & (77) \end{matrix}$

EXAMPLE 11 NRTL Binary Parameters

In Eqs. 14 and 15 for NRTL-SAC, the model formulation requires the asymmetric interaction energy parameters, τ, and the symmetric nonrandom factor parameters, α, for each binary pair of the conceptual segments. In Eqs. 38-40 for eNRTL-SAC of the present invention, one needs additional binary parameters of τ and α between conceptual molecular segments, m and ionic segments, c or a. In practice, Applicants fix the values of α's for the binary pairs of molecular segment and ionic segment to the single value of 0.2 while the values of τ for the binary pairs of molecular segment and ionic segment are calculated from the τ's for the binary pairs of molecular segment and electrolyte segment. Following the same scheme in generalized eNRTL (Chen and Song, 2004b above), one can calculate these binary interaction energy parameters as follows: τ_(cm)=τ_(am)=τ_(em)  (78) τ_(mc,ac)=τ_(ma,ca)=τ_(me)  (79)

Following the treatment of NRTL-SAC (disclosed in U.S. Publication No. 2005/0187748), Applicants identify a reference electrolyte for the conceptual electrolyte segment e. In searching for the reference electrolyte, Applicants choose one elemental electrolyte that has abundant literature data. In one example study, NaCl is used as the reference electrolyte for e. The ionic radii for sodium ion and chloride ion are 1.680×10⁻¹⁰ m and 1.937×10⁻¹⁰m, respectively. With NaCl as the reference electrolyte, the energy parameters for the z-e pair are set to (8.885, −4.549) for the water-NaCl pair. The energy parameters for the x-e pair are set to (15, 5), in line with the parameters identified for C₂H₄—NaCl pair earlier by Chen and Song (Chen, 2004b). The energy parameters for the y-e pairs are set to (12, −3) after limited trials to optimize the performance of the model in this study. The complete set of NRTL binary interaction energy parameters are given in Table 6. Other choices of the reference electrolyte and parameter values may be suitable. The below reports the general behavior of the present invention eNRTL-SAC model based on the parameters reported in Table 6.

The electrolyte segment e is the only extra molecular descriptor and the electrolyte parameter E is the only extra molecular parameter for all electrolytes, inorganic or organic. All local and long range interactions derived from the existence of cationic and anionic species of various ionic charge valence, radius, chemical make-up, etc., are to be accounted for with this extra molecular descriptor for electrolytes together with combinations of conceptual molecular segments, i.e., hydrophobicity, polarity and hydrophilicity. In other words, every electrolyte, organic or inorganic, are modeled as combinations of E, X Y, and Z. As such, electrolytes are recognized as “hydrophobic” electrolytes, “polar” electrolytes, “hydrophilic” electrolytes, and their various combinations. Likewise, ionic activity coefficient of each ionic species will be computed from its share of E, X, Y, and Z. The ions are to be considered as “hydrophobic” ions, “polar” ions, or “hydrophilic” ions.

FIGS. 22 to 26 show effects of the molecular parameters on mean ionic activity coefficients (mole fraction scale) of the reference electrolyte, i.e., electrolyte with E=1. As shown in FIGS. 22 to 26, hydrophobicity parameter X brings down the mean ionic activity coefficient at low electrolyte concentration but in a rather nonlinear way. Polarity parameter Y− raises the mean ionic activity coefficient while polarity parameter Y+ lowers the mean ionic activity coefficient. Hydrophilicity parameter Z has a relatively slight downshift effect on the mean ionic activity coefficient. Electrolyte parameter E brings down the mean ionic activity coefficient at low electrolyte concentration and pushes up the mean ionic activity coefficient at high electrolyte concentration.

Experimental data for ionic activity coefficients are not readily available though emerging (Wilczek-Vera, G. et al, “On the Activity of Ions and the Junction Potential: Revised Values for All Data,” AIChE J., 50:445, 2004). Given the fact that existing experimental data are limited to mean ionic activity coefficient for neutral electrolytes, Applicants are not able to directly identify the molecular parameters for ionic species. In preparing FIGS. 22 to 26 discussed above and the subsequent studies reported in the Model Applications section below, Applicants use Eqs. 30-31 to determine from electrolyte parameter E the ionic segment numbers for the ions and Applicants arbitrarily assign molecular segment parameters (X, Y−, Y+, and Z) only to the anion. This practice is acceptable since virtually all electrolytes investigated in this study are electrolytes with elemental cations.

Limited amount of mean ionic activity coefficient data are available in the public literature for aqueous electrolytes. Applicants test the eNRTL-SAC model 20 as shown in FIG. 3 against mean ionic activity coefficient data of aqueous electrolyte systems. In addition, Applicants test the eNRTL-SAC model against salt solubility data in multiple solvents for a number of inorganic electrolytes and organic electrolytes. To the best of Applicants' knowledge, public literature data is very scarce for such salt solubility data. Proprietary solubility data from industrial collaborators was also used to test the applicability of the eNRTL-SAC model. However, results with such proprietary solubility data are not included in this discussion.

EXAMPLE 12 Mean Ionic Activity Coefficients in Aqueous Systems

For an electrolyte CA that dissociates to cation C and anion A, the mean ionic activity coefficients γ*_(±) is related to individual ionic activity coefficients as follows:

$\begin{matrix} {{\ln\;\gamma_{\pm}^{*}} = {\frac{1}{v}\left( {{v_{C}\ln\;\gamma_{C}^{*}} + {v_{A}\ln\;\gamma_{A}^{*}}} \right)}} & (80) \end{matrix}$ where ν=ν_(C)+ν_(A).

Equation 77 gives the mean ionic activity coefficient on the mole fraction scale and it can be converted to the molality scale: ln γ*_(±m)=ln γ*_(±)−ln(1+νmM_(S)/1000)  (81) where γ*_(±m) is the mean ionic activity coefficient on the molality scale, m is the molality of the salt (mol/kg-solvent), and M_(S) is the molecular weight of the solvent (g/mol).

Table 7 shows the fit to molality scale mean ionic activity coefficient data and the identified electrolyte and molecular parameters for the aqueous inorganic and organic electrolytes at 298.15 K as compiled by of Robinson and Stokes (1970) cited above. All mean ionic activity coefficient data are assumed to have standard deviation of 5%. The data for C5 and higher sodium carboxylates were excluded from the fit because these organic electrolytes were known to form micelles at high electrolyte concentrations (Chen, C.-C. et al., “Segment-Based Excess Gibbs Energy Model for Aqueous Organic Electrolytes, AIChE J., 47:2593, 2001). With a few exceptions such as LiBr, most uni-univalent and uni-bivalent electrolytes are well represented as combinations of E and Y− or Y+ parameters. Most uni-univalent electrolytes have E parameter around unity while higher E values are found for higher valent electrolytes. Applicants also found that the fit seems to deteriorate for electrolytes with higher E values. This observation is consistent with the understanding that higher valent electrolytes are known to prone to the formation of hydrated species or other complexation species. The relatively poor representation of these electrolytes with the model reflects the inadequate assumption of complete dissociation for such electrolytes (Chen, C.-C.; et al., “Unification of Hydration and Dissociation Chemistries with the Electrolyte NRTL Model,” AIChE Journal, 45:1576, 1999). As a derived property, mean ionic activity coefficient becomes meaningless if the complete dissociation assumption of electrolytes does not hold true.

To illustrate the quality of the fit, FIG. 27 shows the comparison of experimental and calculated molality scale mean ionic activity coefficients for five aqueous electrolytes at 298.15 K. The solid lines are the calculated values from the model. It shows that the eNRTL-SAC model provides reasonable qualitative representation of the data while the original eNRTL model (Chen, C.-C. et al., “Local Composition Model for Excess Gibbs Energy of Electrolyte Systems,” AIChE J., 28:588, 1982) achieves excellent quantitative representation of the data.

EXAMPLE 13 Salt Solubility in Mixed Solvent Systems

At the solubility limit of a nonelectrolytes, the solubility product constant, K_(sp), can be written in terms of the product of the solute concentration and the solute activity coefficient at the saturation concentration: K _(sp) =x _(I)γ_(I)  (82)

At the solubility limit of an electrolyte, ionic species precipitate to form salt. ν_(C) C ^(Z) ^(C) +ν_(A) A ^(Z) ^(A) →C _(ν) _(C) A _(ν) _(A) _((S))  (83) The corresponding solubility product constant can be defined as follows. K _(sp) =x _(C) ^(ν) ^(C) γ*_(C) ^(ν) ^(C) x _(A) ^(ν) ^(A) γ*_(A) ^(ν) ^(A)   (84) Eqs. 83 and 84 can be expanded to include solvent molecules and other species if the solid polymorph involves hydrates, other solvent-containing salts, double salts, triple salts, and others.

Applicants tested the applicability of eNRTL-SAC with the very limited public literature data and some proprietary data on solubilities of a number of inorganic and organic electrolytes in various solvents. This description presents the results with solubility data from public literature. To bring certain consistency to the data treatment, Applicants convert all solute solubility data to mole fraction (except for sodium chloride and sodium acetate). Applicants also assign standard deviation of 10% to all solute solubility data within range of 1 to 0.1, standard deviation of 20% to all solute solubility data with range of 0.1 to 0.01, standard deviation of 30% to data with range of 0.01 to 0.001, and so on.

Solubility data of sodium chloride in twelve different solvents at 298.15 K were successfully fitted with the eNRTL-SAC model. (Note that the temperature for the acetone data is 291.15 K and the temperature for the ethyl acetate data is 292.15 K. However, they are included as if they were data at 298.15 K.) The sodium chloride solubilities in the twelve solvents vary by six orders of magnitude. The satisfactory fit of the data for ten solvents (formic acid and ethyl acetate excluded) is shown in FIG. 28. The eNRTL-SAC model predicts one order-of-magnitude higher solubility for sodium chloride in formic acid and virtually no solubility for sodium chloride in ethyl acetate while the data suggests very low but measurable solubility. The molecular parameters and the solubility product constant were adjusted simultaneously to provide the best fit to the data and the identified values are given in Table 8. In Table 8, the last column to the right quantifies goodness of fit to data. It is worth noting that the electrolyte parameter E for sodium chloride is near unity, similar to the parameters reported in Table 3 for sodium chloride.

Solubility data of sodium acetate in five different solvents was also fitted successfully with the eNRTL-SAC model. The solubilities in the five solvents vary by four orders of magnitude. The fit of the data is shown in FIG. 29. The solid phase for the solubility measurements is anhydrous sodium acetate. Note that the data for methanol and acetone was taken at 291.15 K while the data for water and ethylene glycol was taken at 298.15 K. The temperature for the 1-propanol data is not known. In fitting the data, Applicants treated all data as if it was 298.15 K data. The identified molecular parameters and the solubility product constant are given in Table 8. As an organic electrolyte, the electrolyte parameter E for sodium acetate is found to be significantly less than unity.

FIGS. 30 a and 30 b show satisfactory representations of the solubility data of benzoic acid in twenty-six solvents (Beerbower, A. et al., “Expanded Solubility Parameter Approach. I. Naphthalene and Benzoic Acid in Individual Solvents,” J. Pharm. Sci., 73:179, 1984) and the solubility data of sodium benzoate in ten solvents (Bustamante, P. et al., “The Modified Extended Hansen Method to Determine Partial Solubility Parameters of Drugs Containing a Single Hydrogen Bonding Group and Their Sodium Derivatives: Benzoic Acid/Na and Ibuprofen/Na,” Int. J. of Pharmaceutics, 194:117, 2000). These solvents are chosen in this study because of the availability of the NRTL-SAC parameters for the solvents from Applicants' prior work. The identified molecular parameters for the two solutes were given in Table 8. It is interesting that the molecular parameters identified for benzoic acid with twenty-six solvents in this study are quite similar to the molecular parameters identified for benzoic acid with seven solvents in Applicants' earlier study. Applicants also noted that the solubility range expands as benzoic acid is converted to sodium benzoate. Furthermore, the molecular parameters have changed from a hydrophobic/polar/hydrophilic combination (benzoic acid) to a polar/hydrophilic/electrolytic combination (sodium benzoate). Solubility data of sodium benzoate in seven other solvents (chloroform, benzene, dioxane, cyclohexane, ethyl acetate, heptane and chlorobenzene) is excluded from FIG. 29 b because the eNRTL-SAC model predicts virtually no solubility for sodium benzoate in these solvents while the data suggests very low but measurable solubility. It is probable that the molecular form of sodium benzoate may be present in such highly hydrophobic solvents. However, due to their low concentrations, Applicants chose to ignore these low solubility solvents in this study although the current thermodynamic framework can be used to account for the two solubility routes, i.e., Eqs. 82 and 84, individually or simultaneously.

FIGS. 31 a and 31 b show successful representations of the solubility data of salicylic acid in eighteen solvents and the solubility data of sodium salicylate in thirteen solvents (Barra, J. et al., “Proposition of Group Molar Constants for Sodium to Calculate the Partial Solubility Parameters of Sodium Salts Using the van Krevelen Group Contribution Method,” Eur. J. of Pharm. Sci., 10:153, 2000). Their molecular parameters were given in Table 8. Like the molecular parameters for benzoic acid and the sodium salt, the molecular parameters have changed from a hydrophobic/polar/hydrophilic combination (salicylic acid) to a polar/hydrophilic/electrolytic combination (sodium salicylate). Solubility data of sodium salicylate in benzene, cyclohexane, and heptane is excluded from FIG. 31 b, again because the eNRTL-SAC model predicts virtually no solubility of sodium salicylate in these three solvents although the data suggests very low but measurable solubility. Acetic acid is the only outlier among solvents with significant solubility for sodium salicylate. The eNRTL-SAC model prediction for the solubility of sodium salicylate in acetic acid is about one order of magnitude too high. Acetic acid is not included in the thirteen solvents shown in FIG. 31 b.

The eNRTL-SAC model results for the solubility data of p-aminobenzoic acid in nineteen solvents and sodium p-aminobenzoate in twelve solvents (Barra et al., 2000, above) are given in FIGS. 32 a and 32 b. Again, low solubility solvents (benzene, cyclohexane and heptane) are excluded from FIG. 32 b for sodium aminobenzoate. Acetone and DMF are two outliers for sodium aminobenzoate and they are also excluded from FIG. 32 b. The eNRTL-SAC model predicts two orders of magnitude higher solubilities in these two solvents.

The solubility data and model calculations for ibuprofen in nineteen solvents and sodium ibuprofen in eleven solvents (Bustamante et al., 2000 above) are given in FIGS. 33 a and 33 b. In comparison to other organic solutes, one embodiment of the eNRTL-SAC model provides a rather poor fit to the ibuprofen data albeit a better fit than in prior nonelectrolyte models. Applicants did notice that the ibuprofen solubility data from Bustamante et al. are significantly different from those reported by Gracin and Rasmuson (Gracin, S. and A. C. Rasmuson, “Solubility of Phenylacetic Acid, p-Hydroxyphenylacetic Acid, p-Aminophenylacetic acid, p-Hydroxybenzoic acid, and Ibuprofen in Pure Solvents,” J. Chem. Eng. Data, 47:1379, 2002) for certain common solvents including methanol, ethanol, acetone and ethyl acetate. No attempt was made to reconcile the differences between the Bustamante data and the Gracin and Rasmuson data. The eNRTL-SAC model fit to the sodium ibuprofen solubility data appears to be more satisfactory. Again, the eleven solvents reported in FIG. 33 b do not include low solubility solvents (benzene, cyclohexane, heptane, and chlorobenzene). Similarly, acetone and DMF are two outliers for sodium ibuprofen and they are also excluded from FIG. 33 b. The eNRTL-SAC model predicts two orders of magnitude higher solubilities in these two solvents than the available data. Bustamante et al. (2000, above) reported high water content of the ibuprofen sample (3.3 wt % water) and the sodium ibuprofen sample (13 wt % water). It is not clear how such high water contents in the samples could impact on the solubility measurements.

The solubility data for diclofenac in sixteen solvents and sodium diclofenac in ten solvents (Barra et al., 2000 above) are fitted and reported in FIGS. 34 a and 34 b. The eNRTL-SAC model significantly overestimates the solubilities of diclofenac in acetic acid, formamide and ethylene glycol. These three solvents are excluded from the sixteen solvents shown in FIG. 34 a. Data for low solubility solvents (benzene, cyclohexane, ethyl acetate, heptane and chlorobenzene) for sodium diclofenac are excluded from FIG. 34 b. Acetic acid and acetone are two outliers with the model estimations one to three orders of magnitude higher solubilities for sodium diclofenac. The two solvents are not included in FIG. 34 b.

The solubility data treatment above assumes complete dissociation of electrolytes and considers the solubility problem as formation of salts from ionized species of electrolytes, i.e., Eq. 84. One may argue that electrolytes do not dissociate completely into ionic species especially in organic solvents of low dielectric constant. In the absence of dissociation to ionic species, the solubility relationship can be described by Eq. 82 and the eNRTL-SAC model of the present invention reduces to the NRTL-SAC model of the parent patent application. Applicants have treated the electrolyte systems above as nonelectrolytes (i.e., no dissociation to ionic species) with NRTL-SAC and the model results are also included in Table 8. With the absence of electrolyte parameter, the representation of the solubility data deteriorates substantially. Applicants also noted that the identified molecular parameters (X, Y−, Y+, and Z) with the complete dissociation treatment are roughly twice as large as those reported with the non-dissociation treatment. This finding is consistent with the fact that Applicants only assign the molecular parameters (X Y−, Y+, and Z) to the anion.

TABLE 5 Dielectric Constant of Solvents at 298.15 K dielectric constant solvent name at 298.15 K Acetic acid 6.13 Acetone 20.83 Acetonitrile 36.97 Anisole 4.3 Benzene 2.27 1-Butanol 17.7 2-Butanol 15.8 n-Butyl-acetate 5.1 Methyl-tert-butyl-ether 2.6 Carbon-tetrachloride 2.23 Chlorobenzene 5.56 Chloroform 4.7 Cumene 2.22 Cyclohexane 2.02 1,2-Dichloroethane 10.19 1,1-Dichloroethylene 4.6 1,2-Dichloroethylene 4.6 Dichloromethane 8.9 1,2-Dimethoxyethane not available N,N-Dimethylacetamide not available N,N-Dimethylformamide 38.3 Dimethyl-sulfoxide 47.2 1,4-Dioxane 2.21 Ethanol 24.11 2-Ethoxyethanol not available Ethyl-acetate 6.02 Ethylene-glycol 41.2 Diethyl-ether 4.26 Ethyl-formate 7.16 Formamide 109.5 Formic-acid 58.5 n-Heptane 1.92 n-Hexane 1.89 Isobutyl-acetate 5.6 Isopropyl-acetate not available Methanol 32.62 2-Methoxyethanol not available Methyl-acetate 6.68 3-Methyl-1-butanol 14.7 2-Hexanone 14.6 Methylcyclohexane 2.02 Methyl-ethyl-ketone 18.5 Methyl-isobutyl-ketone 13.1 Isobutanol 17.9 N-Methyl-2-pyrrolidone 33 Nitromethane 6.26 n-Pentane 1.84 1-Pentanol 13.9 1-Propanol 20.1 Isopropyl-alcohol 19.9 n-propyl-acetate 6 Pyridine 2.3 Sulfolane 43.3 Tetrahydrofuran 7.52 1,2,3,4-Tetrahydronaphthalene not available Toluene 2.36 1,1,1-Trichloroethane 7.5 Trichloroethylene 3.42 m-Xylene 2.24 Water 78.54 Triethylamine 2.44 1-Octanol 10.3

TABLE 6 NRTL Binary Interaction Parameters Segment (1) x x y− y+ x x y− y+ z Segment (2) y− z z z y+ e e e e τ₁₂ 1.643 6.547 −2.000 2.000 1.643 15 12 12 8.885 τ₂₁ 1.834 10.949 1.787 1.787 1.834 5 −3 −3 −4.549 α₁₂ = α₂₁ 0.2 0.2 0.3 0.3 0.2 0.2 0.2 0.2 0.2

TABLE 7 Results of Fit for Molality Scale Mean Ionic Activity Coefficient Data of Aqueous Electrolytes at 298.15 K (Data from Robinson and Stokes, 1970) max. E Y− Y+ σ^(□) molality 1-1 Electrolytes AgNO₃ 0.738 1.758 0.050 6.0 CsAc 1.002 0.438 0.011 3.5 CsBr 0.950 0.678 0.013 5.0 CsCl 0.948 0.643 0.014 6.0 CsI 0.956 0.719 0.012 3.0 CsNO₃ 0.981 1.328 0.005 1.4 CsOH 0.942 0.354 0.002 1.0 HBr 1.135 0.654 0.034 3.0 HCl 1.324 0.524 0.087 6.0 HClO₄ 1.476 0.569 0.136 6.0 HI 1.117 0.824 0.035 3.0 HNO₃ 0.971 0.211 0.005 3.0 KAc 0.998 0.386 0.009 3.5 KBr 0.910 0.311 0.011 5.5 KBrO₃ 0.968 1.141 0.002 0.5 KCl 0.920 0.370 0.010 4.5 KClO₃ 0.958 1.053 0.003 0.7 KCNS 0.876 0.477 0.019 5.0 KF 0.987 0.042 0.004 4.0 KH Malonate 0.846 0.920 0.022 5.0 KH Succinate 0.912 0.665 0.011 4.5 KH₂PO₄ 0.970 1.362 0.006 1.8 KI 0.903 0.168 0.011 4.5 KNO₃ 0.856 1.461 0.027 3.5 KOH 1.236 0.344 0.058 6.0 K Tol 0.750 1.296 0.026 3.5 LiAc 0.962 0.097 0.002 4.0 LiBr 1.422 0.526 0.116 6.0 LiCl 1.282 0.436 0.084 6.0 LiClO₄ 1.145 0.681 0.047 4.0 LiI 1.058 0.712 0.033 3.0 LiNO₃ 1.050 0.294 0.022 6.0 LiOH 1.028 0.652 0.022 4.0 LiTol 0.881 0.392 0.014 4.5 NaAc 0.978 0.301 0.005 3.5 NaBr 0.992 0.115 0.008 4.0 NaBrO₃ 0.923 0.802 0.010 2.5 Na Butyrate 0.989 0.566 0.009 3.5 NaCl 1.000 0.017 6.0 NaClO₃ 0.891 0.507 0.011 3.5 NaClO₄ 0.894 0.267 0.010 6.0 NaCNS 0.925 0.128 0.006 4.0 NaF 0.976 0.425 0.002 1.0 Na Formate 0.905 0.094 0.013 3.5 NaH Malonate 0.878 0.664 0.019 5.0 NaH Succinate 0.924 0.495 0.010 5.0 NaH₂PO₄ 0.864 1.256 0.020 6.0 NaI 1.009 0.266 0.012 3.5 NaNO₃ 0.825 0.842 0.029 6.0 NaOH 1.080 0.109 0.039 6.0 Na Propionate 0.992 0.448 0.006 3.0 Na Tol 0.793 0.920 0.026 4.0 NH₄Cl 0.884 0.424 0.019 6.0 NH₄NO₃ 0.813 1.128 0.043 6.0 RbAc 1.012 0.416 0.011 3.5 RbBr 0.914 0.519 0.016 5.0 RbCl 0.929 0.466 0.012 5.0 RbI 0.925 0.520 0.014 5.0 RbNO₃ 0.815 1.611 0.038 4.5 TlAc 0.864 0.952 0.033 6.0 TlClO₄ 1.020 1.231 0.000 0.5 TlNO3 1.069 1.692 0.003 0.4 1-2 Electrolytes Cs₂SO₄ 1.161 2.568 0.050 1.8 K₂CrO₄ 1.048 2.738 0.075 3.5 K₂SO₄ 1.386 2.475 0.021 0.7 Li₂SO₄ 1.138 2.177 0.051 3.0 Na₂CrO₄ 1.091 2.443 0.051 4.0 Na₂ Fumarate 1.259 1.770 0.041 2.0 Na₂ Maleate 1.202 2.699 0.075 3.0 Na₂SO₄ 0.988 3.273 0.090 4.0 Na₂S₂O₃ 1.071 2.709 0.064 3.5 (NH₄)₂SO₄ 1.006 3.477 0.118 4.0 Rb₂SO₄ 1.150 2.743 0.052 1.8 1-3 Electrolytes K₃Fe(CN)₆ 1.328 4.996 0.101 1.4 1-4 Electrolytes K₄Fe(CN)₆ 1.449 9.448 0.146 0.9 2-1 Electrolytes BaAc₂ 1.016 0.997 0.128 3.5 BaBr₂ 1.267 0.358 0.018 2.0 BaCl₂ 1.227 0.585 0.029 1.8 Ba(ClO₄)₂ 1.305 0.261 0.049 5.0 BaI₂ 1.354 0.028 0.017 2.0 Ba(NO₃)₂ 1.435 1.268 0.008 0.4 CaBr₂ 1.969 0.171 0.495 6.0 CaCl₂ 1.701 0.309 0.283 6.0 Ca(ClO₄)₂ 2.021 0.431 6.0 CaI₂ 1.419 0.131 0.036 2.0 Ca(NO₃)₂ 1.108 0.875 0.053 6.0 CdBr₂ 1.324 3.164 0.294 4.0 CdCl₂ 1.052 3.047 0.315 6.0 CdI₂ 1.780 3.820 0.337 2.5 Cd(NO₃)₂ 1.176 0.500 0.037 2.5 CoBr₂ 1.779 0.218 5.0 CoCl₂ 1.397 0.194 0.046 4.0 CoI₂ 2.260 0.488 6.0 Co(NO₃)₂ 1.444 0.296 0.113 5.0 CuCl₂ 1.033 0.425 1.217 0.069 6.0 Cu(NO₃)₂ 1.409 0.416 0.117 6.0 FeCl₂ 1.319 0.255 0.011 2.0 MgAc₂ 1.192 0.946 0.059 4.0 MgBr₂ 1.941 0.347 5.0 MgCl₂ 1.745 0.144 0.275 5.0 Mg(ClO₄)₂ 1.988 0.162 0.303 4.0 MgI₂ 2.237 0.470 5.0 Mg(NO₃)₂ 1.493 0.198 0.140 5.0 MnCl₂ 1.273 0.343 0.020 6.0 NiCl₂ 1.533 0.189 0.123 5.0 Pb(ClO₄)₂ 1.549 0.236 0.184 6.0 Pb(NO₃)₂ 1.129 1.964 0.083 2.0 SrBr₂ 1.330 0.183 0.023 2.0 SrCl₂ 1.401 0.357 0.082 4.0 Sr(ClO₄)₂ 1.742 0.034 0.261 6.0 SrI₂ 1.384 0.076 0.030 2.0 Sr(NO₃)₂ 0.978 1.250 0.091 4.0 UO₂Cl₂ 1.277 0.024 0.017 3.0 UO₂(ClO₄)₂ 2.854 0.883 5.5 UO₂(NO₃)₂ 1.392 0.372 0.490 0.036 5.5 ZnBr₂ 0.906 0.337 0.088 6.0 ZnCl₂ 0.953 0.971 0.065 6.0 Zn(ClO₄)₂ 2.045 0.130 0.318 4.0 ZnI₂ 0.868 0.132 0.116 6.0 Zn(NO₃)₂ 1.518 0.214 0.176 6.0 2-2 Electrolytes BeSO₄ 1.376 4.077 0.233 4.0 MgSO₄ 1.380 4.206 0.238 3.0 MnSO₄ 1.287 4.460 0.271 4.0 NiSO₄ 1.398 4.381 0.220 2.5 CuSO₄ 1.587 4.114 0.154 1.4 ZnSO₄ 1.339 4.417 0.242 3.5 CdSO₄ 1.295 4.547 0.271 3.5 UO₂SO₄ 1.215 4.528 0.309 6.0 3-1 Electrolytes AlCl₃ 1.730 0.579 0.087 1.8 CeCl₃ 1.562 0.883 0.047 1.8 CrCl₃ 1.589 0.641 0.022 1.2 Cr(NO₃)₃ 1.551 0.761 0.036 1.4 EuCl₃ 1.586 0.820 0.049 2.0 LaCl₃ 1.553 0.877 0.042 2.0 NdCl₃ 1.575 0.882 0.045 2.0 PrCl₃ 1.562 0.892 0.042 2.0 ScCl₃ 1.636 0.709 0.041 1.8 SmCl₃ 1.581 0.843 0.046 2.0 YCl₃ 1.629 0.807 0.057 2.0 3-2 Electrolytes Al₂(SO₄)₃ 1.354 4.886 0.222 1.0 Cr₂(SO₄)₃ 1.257 4.549 0.218 1.2 4-1 Electrolytes Th(NO₃)₄ 1.273 1.251 0.056 5.0 ^(1.) σ is defined to be $\left\lbrack {\sum\limits_{i}^{N}\;{\left( \frac{\gamma_{\pm i}^{*\exp} - \gamma_{\pm i}^{*{cal}}}{\gamma_{\pm i}^{*\exp}} \right)^{2}\text{/}N}} \right\rbrack^{1/2}$ where γ_(±)* is the mean ionic activity coefficient of electrolyte and N is the number of data used in correlations

TABLE 8 eNRTL-SAC Model Parameters for Solutes solute no. of solvents X Y− Y+ Z E ln K_(sp) σ⁴ benzoic acid¹ 26 0.494 0.336 0.468 −1.714 0.292 salcylic acid¹ 18 0.726 0.176 0.749 −1.624 0.774 p-aminobenzoic acid¹ 19 0.552 0.423 0.594 0.881 −3.348 1.206 Ibuprofen¹ 19 1.038 0.051 0.028 0.318 −1.423 1.055 Diclofenac¹ 16 0.158 1.678 0.451 −3.560 0.991 sodium chloride² 10 1.444 0.994 −6.252 0.783 sodium acetate² 5 1.417 0.521 −6.355 0.241 sodium benzoate² 10 0.750 1.685 2.201 0.539 −7.312 0.493 sodium salicylate² 13 0.845 2.417 0.090 −4.889 0.771 sodium p-aminobenzoate² 12 2.299 2.387 0.192 −8.293 1.258 sodium ibuprofen² 11 1.819 1.743 2.362 0.150 −17.844 0.886 sodium diclofenac² 10 0.409 3.558 3.486 0.161 −14.202 0.858 sodium chloride³ 10 1.060 2.200 −3.540 0.923 sodium acetate³ 5 0.249 0.679 −2.277 0.281 sodium benzoate³ 10 0.179 1.825 −2.978 0.699 sodium salicylate³ 13 0.373 1.572 −2.153 1.058 sodium p-aminobenzoate³ 12 0.125 0.649 1.895 −3.247 1.904 sodium ibuprofen³ 11 0.270 0.394 0.823 −2.364 1.685 sodium diclofenac³ 10 0.454 0.124 2.493 −4.405 1.473 ¹nonelectrolytes ²electrolytes ³treated as nonelectrolytes ⁴σ is defined to be $\left( {\sum\limits_{i}^{N}\;{\left( {{\ln\mspace{11mu} x_{i}^{\exp}} - {\ln\mspace{11mu} x_{i}^{cal}}} \right)^{2}\text{/}N}} \right)^{1/2}\;$ where x is the solubility of solute, i.e., mole fraction (note that mass fraction for sodium chloride and sodium acetate), and N is the number of data used in correlations.

While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims. 

What is claimed is:
 1. A method of conducting a pharmaceutical activity, the method comprising modeling at least one physical property of a mixture of at least two chemical species by the computer implemented steps of: a) providing a modeler configured to be executable by a processor, the modeler during execution being formed of (i) a databank of molecular descriptors of known chemical species, and (ii) a calculator of molecular descriptors of unknown chemical species; b) determining at least one conceptual segment, instead of a molecular structural segment, for each of the at least two chemical species, the conceptual segment being determined from in-mixture behavior of the at least two chemical species, including for each conceptual segment, (i) identifying the conceptual segment as one of a hydrophobic segment, a hydrophilic segment, a polar segment, or a combination thereof, and (ii) defining an equivalent number for the conceptual segment, the equivalent number being based on experimental phase equilibrium data and being one of carried in the databank of molecular descriptors of known chemical species or obtained using the calculator of molecular descriptors of unknown chemical species by regression of experimental phase equilibrium data for binary systems of unknown chemical species and reference chemical species; c) providing the determined at least one conceptual segment to the modeler, and in response the modeler using the determined at least one conceptual segment to compute at least one physical property of the mixture, including any one of vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, or a combination thereof, the modeler computing the at least one physical property by determining an activity coefficient of one of the at least two chemical species, the activity coefficient being formed of at least a residual contribution to the activity coefficient of the one chemical species, the modeler setting the residual contribution equal to a local composition interaction contribution to the activity coefficient for the one chemical species based on the determined at least one conceptual segment; d) analyzing the computed at least one physical property using the modeler, in a comparison to the computed at least one physical property of other mixtures of at least two chemical species, and forming therefrom a model of the at least one physical property of the mixture useable in conducting the pharmaceutical activity; and e) outputting the formed model from the modeler to a computer display monitor in a manner enabling the pharmaceutical activity.
 2. The method of claim 1, wherein the pharmaceutical activity includes at least one of: a) drug discovery, development or manufacture; b) drug design, drug synthesis, drug characterization, drug screen and assay, clinical evaluation, or drug purification; and c) a study on a chemical interaction within the mixture.
 3. The method of claim 2, wherein the study includes one or more of pharmacokinetics, pharmacodynamics, solvent screening, combination drug therapy, drug toxicity, a process design for an active pharmaceutical ingredient, chromatography, drug formulation, and drug synthesis.
 4. The method of claim 2, wherein the drug synthesis includes distillation, screening, crystallization, filtration, washing, and drying.
 5. The method of claim 1, wherein the mixture includes any number and combination of vapor, solid and liquid phases.
 6. The method of claim 1, wherein the mixture includes at least one liquid solvent and at least one pharmaceutical component.
 7. The method of claim 6, wherein the mixture includes more than one phase and at least a portion of the at least one pharmaceutical component is in a liquid phase.
 8. The method of claim 1, wherein one of the at least two chemical species is a pharmaceutical component, and the pharmaceutical component is an active pharmaceutical ingredient.
 9. The method of claim 1, wherein at least one of the one or more species is in an amorphous phase and is an active pharmaceutical ingredient.
 10. The method of claim 9, further including a step of estimating an amorphous phase solubility by calculating a phase equilibrium between a solute rich phase and a solvent rich phase.
 11. The method of claim 1, wherein the mixture includes a solid phase, a liquid phase, and a first chemical species, wherein at least a portion of the first chemical species is in the solid phase, and the step of computing at least one physical property includes calculating: ${{\ln\; x_{I}^{SAT}} = {{\frac{\Delta_{fus}S}{R}\left( {1 - \frac{T_{m}}{T}} \right)} - {\ln\;\gamma_{I}^{SAT}}}},$ wherein T is the temperature of the mixture, T_(m) is the melting temperature of the solid phase compound, T is less than or about equal to T_(m), x_(I) ^(SAT) is the mole fraction of the first chemical species dissolved in the liquid phase at saturation, Δ_(fus)S is the entropy of fusion of the first chemical species, γ_(I) ^(SAT) is the activity coefficient for the first chemical species in the liquid phase at saturation, and R is the gas constant.
 12. A method of claim 11, wherein the step of computing at least one physical property further includes determining γ_(I), wherein ln γ_(I)=ln γ_(I) ^(C)+ln γ_(I) ^(R), γ_(I) is an activity coefficient for a component of the mixture, γ_(I) ^(C) is a combinatorial contribution to the activity coefficient for the component of the mixture, and γ_(I) ^(R) is a residual contribution to the activity coefficient of the component.
 13. A method of claim 12, wherein the step of computing at least one physical property further includes computing $\mspace{79mu}{{{\ln\;\gamma_{I}^{R}} = {{\ln\;\gamma_{I}^{lc}} = {\sum\limits_{m}{r_{m,I}\left\lfloor {{\ln\;\gamma_{m}^{lc}} - {\ln\;\gamma_{m}^{{lc},I}}} \right\rfloor}}}},{wherein}}$ $\mspace{79mu}{{{\ln\;\gamma_{m}^{lc}} = {\frac{\sum\limits_{j}{x_{j}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k}G_{km}}} + {\sum\limits_{m^{\prime}}{\frac{x_{m^{\prime}}G_{{mm}^{\prime}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}\left( {\tau_{{mm}^{\prime}} - \frac{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}\tau_{{km}^{\prime}}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}} \right)}}}};}$ ${{\ln\;\gamma_{m}^{{lc},I}} = {\frac{\sum\limits_{j}{x_{j,I}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k,I}G_{km}}} + {\sum\limits_{m^{\prime}}{\frac{x_{m^{\prime},I}G_{{mm}^{\prime}}}{\sum\limits_{k}{x_{k,I}G_{{km}^{\prime}}}}\left( {\tau_{{mm}^{\prime}} - \frac{\sum\limits_{k}{x_{k,I}G_{{km}^{\prime}}\tau_{{km}^{\prime}}}}{\sum\limits_{k}{x_{k,I}G_{{km}^{\prime}}}}} \right)}}}};$ $\mspace{79mu}{{x_{j} = \frac{\sum\limits_{J}{x_{J}r_{j,J}}}{\sum\limits_{I}{\sum\limits_{i}{x_{I}r_{i,I}}}}};{x_{j,I} = \frac{r_{j,I}}{\sum\limits_{j}r_{j,I}}};}$ i, j, k, m, and m′ are segment species; I and J are components; x_(j) is a segment mole fraction of segment species j; x_(J) is a mole fraction of component J; r_(m,I) is the number of segment species m contained in component I, γ_(m) ^(lc) is an activity coefficient of segment species m, and γ_(m) ^(lc,I) is an activity coefficient of segment species m contained only in component I; G and τ are local binary quantities related to each other by a non-random factor parameter α; and G=exp(−ατ).
 14. The method of claim 1, wherein the at least two chemical species includes at least one electrolyte.
 15. The method of claim 14, further including the steps of using a determined conceptual electrolyte segment and computing at least one physical property of the mixture.
 16. The method of claim 14, wherein the electrolyte is any one of a pharmaceutical compound, a nonpolymeric compound, a polymer, an oligomer, an inorganic compound and an organic compound.
 17. The method of claim 14, wherein the electrolyte is symmetrical or unsymmetrical.
 18. The method of claim 14, wherein the electrolyte is univalent or multivalent.
 19. The method of claim 14, wherein the electrolyte includes two or more ionic species.
 20. The method of claim 15, wherein the conceptual electrolyte segment includes a cationic segment and an anionic segment, both segments of unity of charge.
 21. The method of claim 15, wherein the step of computing at least one physical property includes calculating the activity coefficient of the ionic species derived from the electrolyte.
 22. The method of claim 21, wherein the computed physical property of the analysis includes at least one of activity coefficient, vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, and lipophilicity of the electrolyte.
 23. The method of claim 22, wherein the step of computing the solubility of the electrolyte includes calculating: ${{K_{sp}(T)} = {\prod\limits_{C}{x_{C}^{v_{C},{SAT}}\gamma_{C}^{{*v_{C}},{SAT}}{\prod\limits_{A}{x_{A}^{v_{A},{SAT}}\gamma_{A}^{{*v_{A}},{SAT}}{\prod\limits_{M}{x_{M}^{SAT}\gamma_{M}^{SAT}}}}}}}},$ wherein: K_(sp) is the solubility product constant for the electrolyte, T is the temperature of the mixture, x_(C) ^(ν) ^(C) ^(SAT) is the mole fraction of a cation derived from the electrolyte at saturation point of the electrolyte, x_(A) ^(ν) ^(A) ^(SAT) is the mole fraction of an anion derived from the electrolyte at saturation point of the electrolyte, x_(M) ^(ν) ^(M) ^(SAT) is the mole fraction of a neutral molecule derived from the electrolyte at saturation point of the electrolyte, γ*_(C) ^(ν) ^(C) ^(,SAT) is the activity coefficient of a cation derived from the electrolyte at the saturation concentration; γ*_(A) ^(ν) ^(A) ^(,SAT) is the activity coefficient of an anion derived from the electrolyte at the saturation concentration; γ*_(M) ^(ν) ^(M) ^(,SAT) is the activity coefficient of a neutral molecule derived from the electrolyte at the saturation concentration; C is the cation, A is the anion, M is one or more solvent or solute molecule, T is the temperature of the mixture, γ* is the unsymmetric activity coefficient of a species in solution, SAT is saturation concentration, υ_(C) is the cationic stoichiometric coefficient, υ_(A) is the anionic stoichiometric coefficient, and υ_(M) is the neutral molecule stoichiometric coefficient.
 24. The method of claim 23, wherein the solvent is water, and the step of computing at least one physical property includes calculating: ln γ*_(I)=ln γ*_(I) ^(lc)+ln γ*_(I) ^(PDH)+ln γ*_(I) ^(FH), wherein: I is the ionic specie; ln γ*_(I); is the logarithm of an activity coefficient of I; ln γ*_(I) ^(lc) is the local composition term of I; ln γ*_(I) ^(PDH) is the Pitzer-Debye-Hückel term of I; and ln γ*_(I) ^(FH) is the Flory-Huggins term of I.
 25. The method of claim 23, wherein the one or more solvents include mixed-solvent solutions, and the step of computing at least one physical property including calculating: ln γ*_(I)=ln γ*_(I) ^(lc)+ln γ*_(I) ^(PDH)+ln γ*_(I) ^(FH)+Δ ln γ_(I) ^(Born), wherein: I is the ionic specie; ln γ*_(I) is the logarithm of an activity coefficient of I; ln γ*_(I) ^(lc) is the local interaction contribution of I; ln γ*_(I) ^(PDH) is the Pitzer-Debye-Hückel term of I; ln γ*_(I) ^(FH) is the Flory-Huggins term of I; and Δ ln γ_(I) ^(Born) is the Born term of I.
 26. The method of claim 14, wherein if the mixture includes a single electrolyte, the step of defining the segment number includes calculating: r _(c,C) =r _(e,CA) Z _(C) and r _(a,A) =r _(e,CA) Z _(A), wherein: r_(e) is the electrolyte segment number, r_(c) is the cationic segment number, r_(a) is the anionic segment number, where r_(c) and r_(a) satisfy electroneutrality; CA is an electrolyte, wherein C is a cation, and A is an anion; and Z_(C) is the charge number for the cation C, and Z_(A) is the charge number for the anion A; and if the mixture includes multiple electrolytes, the step of defining the segment number includes calculating: $r_{c,C} = {\sum\limits_{A}{r_{e,{CA}}{Z_{C}\left( {x_{A}{Z_{A}/{\sum\limits_{A^{\prime}}{x_{A^{\prime}}Z_{A^{\prime}}}}}} \right)}\mspace{14mu}{and}}}$ ${r_{a,A} = {\sum\limits_{C}{r_{e,{CA}}{Z_{A}\left( {x_{C}{Z_{C}/{\sum\limits_{C^{\prime}}{x_{C^{\prime}}Z_{C^{\prime}}}}}} \right)}}}},$ wherein: r_(e) is the segment number, r_(c) is the cationic segment number, r_(a) is the anionic segment number, where r_(c) and r_(a) satisfy electroneutrality; CA is an electrolyte, wherein C is a cation, and A is an anion; C′A′ is other electrolyte(s), wherein C′ is a cation and A′ is an anion; Z_(C) is a charge number for C, and Z_(A) is a charge number for A; Z_(C′) is a charge number for C′, and Z_(A′) is a charge number for A′; x_(A) is a mole fraction of A, and x_(C) is a mole fraction of C; and x_(A′) is a mole fraction of A′, and x_(C′) is a mole fraction of C′.
 27. A method of separating one or more chemical species from a mixture, the method comprising modeling at least one physical property of a mixture of at least two chemical species by the computer implemented steps of: a) providing a modeler configured to be executable by a processor, the modeler during execution being formed of (i) a databank of molecular descriptors of known chemical species, and (ii) a calculator of molecular descriptors of unknown chemical species; b) determining at least one conceptual segment, instead of a molecular structural segment, for each of the at least two chemical species, the conceptual segment being determined from in-mixture behavior of the at least two chemical species, including for each conceptual segment, (i) identifying the conceptual segment as one of a hydrophobic segment, a hydrophilic segment, a polar segment, or a combination thereof, and (ii) defining an equivalent number for the conceptual segment, the equivalent number being based on experimental phase equilibrium data and being one of carried in the databank of molecular descriptors of known chemical species or obtained using the calculator of molecular descriptors of unknown chemical species by regression of experimental phase equilibrium data for binary systems of unknown chemical species and reference chemical species; c) providing the determined at least one conceptual segment to the modeler, and in response the modeler using the determined at least one conceptual segment to compute at least one physical property of the mixture, including any one of vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, or a combination thereof, the modeler computing the at least one physical property by determining an activity coefficient of one of the at least two chemical species, the activity coefficient being formed of at least a residual contribution to the activity coefficient of the one chemical species, the modeler setting the residual contribution equal to a local composition interaction contribution to the activity coefficient for the one chemical species based on the determined at least one conceptual segment; d) analyzing the computed at least one physical property using the modeler, in a comparison to the computed at least one physical property of other mixtures of at least two chemical species, and forming therefrom a model of the at least one physical property of the mixture useable in separating one or more chemical species from the mixture; and e) outputting the formed model from the modeler to a computer display monitor in a manner enabling separating one or more chemical species from the mixture.
 28. The method of claim 27, wherein separation of one or more species from a mixture uses chromatography.
 29. The method of claim 28, wherein the chromatography includes capillary-action chromatography, paper chromatography, thin layer chromatography, column chromatography, fast protein liquid chromatography, high performance liquid chromatography, ion exchange chromatography, affinity chromatography, gas-liquid chromatography, high performance liquid chromatography, and countercurrent chromatography.
 30. The method of claim 27, wherein the mixture includes at least one liquid phase.
 31. The method of claim 27, wherein the at least one chemical species is an active pharmaceutical ingredient.
 32. The method of claim 27, wherein the step of computing at least one physical property further includes determining γ_(I), wherein ln γ_(I)=ln γ_(I) ^(C)+ln γ_(I) ^(R), γ_(I) is an activity coefficient for a component of the mixture, γ_(I) ^(C) is a combinatorial contribution to the activity coefficient for the component of the mixture, and γ_(I) ^(R) is a residual contribution to the activity coefficient of the component.
 33. The method of claim 32, wherein the step of computing at least one physical property further includes computing $\mspace{79mu}{{{\ln\;\gamma_{I}^{R}} = {{\ln\;\gamma_{I}^{lc}} = {\sum\limits_{m}{r_{m,I}\left\lfloor {{\ln\;\gamma_{m}^{lc}} - {\ln\;\gamma_{m}^{{lc},I}}} \right\rfloor}}}},{wherein}}$ $\mspace{79mu}{{{\ln\;\gamma_{m}^{lc}} = {\frac{\sum\limits_{j}{x_{j}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k}G_{km}}} + {\sum\limits_{m^{\prime}}{\frac{x_{m^{\prime}}G_{{mm}^{\prime}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}\left( {\tau_{{mm}^{\prime}} - \frac{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}\tau_{{km}^{\prime}}}}{\sum\limits_{k}{x_{k}G_{{km}^{\prime}}}}} \right)}}}};}$ ${{\ln\;\gamma_{m}^{{lc},I}} = {\frac{\sum\limits_{j}{x_{j,I}G_{jm}\tau_{jm}}}{\sum\limits_{k}{x_{k,I}G_{km}}} + {\sum\limits_{m^{\prime}}{\frac{x_{m^{\prime},I}G_{{mm}^{\prime}}}{\sum\limits_{k}{x_{k,I}G_{{km}^{\prime}}}}\left( {\tau_{{mm}^{\prime}} - \frac{\sum\limits_{k}{x_{k,I}G_{{km}^{\prime}}\tau_{{km}^{\prime}}}}{\sum\limits_{k}{x_{k,I}G_{{km}^{\prime}}}}} \right)}}}};$ $\mspace{79mu}{{x_{j} = \frac{\sum\limits_{J}{x_{J}r_{j,J}}}{\sum\limits_{I}{\sum\limits_{i}{x_{I}r_{i,I}}}}};{x_{j,I} = \frac{r_{j,I}}{\sum\limits_{j}r_{j,I}}};}$ i, j, k, m, and m′ are segment species; I and J are components; x_(j) is a segment mole fraction of segment species j; x_(J) is a mole fraction of component J; R_(m,I) is the number of segment species m contained in component I, γ_(m) ^(lc); is an activity coefficient of segment species m, and γ_(m) ^(lc,I) is an activity coefficient of segment species m contained only in component I; G and τ are local binary quantities related to each other by a non-random factor parameter α; and G=exp(−ατ).
 34. The method of claim 27, wherein the at least two chemical species includes at least one electrolyte.
 35. The method of claim 34, further including the steps of using a determined conceptual electrolyte segment and computing at least one physical property of the mixture.
 36. The method of claim 35, wherein the conceptual electrolyte segment includes a cationic segment and an anionic segment, both segments of unity of charge.
 37. The method of claim 35, wherein the step of computing at least one physical property includes calculating the activity coefficient of the ionic species derived from the electrolyte.
 38. The method of claim 37, wherein the computed physical property of the analysis includes at least one of activity coefficient, vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, and lipophilicity of the electrolyte.
 39. The method of claim 38, wherein the step of computing the solubility of the electrolyte includes calculating: ${{K_{sp}(T)} = {\prod\limits_{C}{x_{C}^{v_{C},{SAT}}\gamma_{C}^{{*v_{C}},{SAT}}{\prod\limits_{A}{x_{A}^{v_{A},{SAT}}\gamma_{A}^{{*v_{A}},{SAT}}{\prod\limits_{M}{x_{M}^{SAT}\gamma_{M}^{SAT}}}}}}}},$ wherein: K_(sp) is the solubility product constant for the electrolyte, T is the temperature of the mixture, x_(C) ^(ν) ^(C) ^(SAT) is the mole fraction of a cation derived from the electrolyte at saturation point of the electrolyte, x_(A) ^(ν) ^(A) ^(SAT) is the mole fraction of an anion derived from the electrolyte at saturation point of the electrolyte, x_(C) ^(ν) ^(M) ^(SAT) is the mole fraction of a neutral molecule derived from the electrolyte at saturation point of the electrolyte, γ*_(C) ^(ν) ^(C) ^(,SAT) is the activity coefficient of a cation derived from the electrolyte at the saturation concentration; γ*_(A) ^(ν) ^(A) ^(SAT) is the activity coefficient of an anion derived from the electrolyte at the saturation concentration; γ*_(M) ^(ν) ^(M) ^(,SAT) is the activity coefficient of a neutral molecule derived from the electrolyte at the saturation concentration; C is the cation, A is the anion, M is one or more solvent or solute molecule, T is the temperature of the mixture, γ* is the unsymmetric activity coefficient of a species in solution, SAT is saturation concentration, υ_(C) is the cationic stoichiometric coefficient, υ_(A) is the anionic stoichiometric coefficient, and υ_(M) is the neutral molecule stoichiometric coefficient.
 40. The method of claim 39, wherein the solvent is water, and the step of computing at least one physical property includes calculating: ln γ*_(I)=ln γ*_(I) ^(lc)+ln γ*_(I) ^(PDH)+ln γ*_(I) ^(FH), wherein: I is the ionic specie; ln γ*_(I) is the logarithm of an activity coefficient of I; ln γ*_(I) ^(lc) is the local composition term of I; ln γ*_(I) ^(PDH) is the Pitzer-Debye-Hückel term of I; and ln γ*_(I) ^(FH) is the Flory-Huggins term of I.
 41. The method of claim 39, wherein the one or more solvents include mixed-solvent solutions, and the step of computing at least one physical property including calculating: ln γ*_(I)=ln γ*_(I) ^(lc)+ln γ*_(I) ^(PDH)+ln γ*_(I) ^(FH)+Δ ln γ_(I) ^(Born), wherein: I is the ionic specie; ln γ*_(I) is the logarithm of an activity coefficient of I; ln γ*_(I) ^(lc) is the local interaction contribution of I; ln γ*_(I) ^(PDH) is the Pitzer-Debye-Hückel term of I; ln γ*_(I) ^(FH) is the Flory-Huggins term of I; and Δ ln γ_(I) ^(Born) is the Born term of I.
 42. The method of claim 34, wherein if the mixture includes a single electrolyte, the step of defining the segment number includes calculating: r _(c,C) =r _(e,CA) Z _(C) and r _(a,A) =r _(e,CA) Z _(A), wherein: r_(e) is the electrolyte segment number, r_(c) is the cationic segment number, r_(a) is the anionic segment number, where r_(c) and r_(a) satisfy electroneutrality; CA is an electrolyte, wherein C is a cation, and A is an anion; and Z_(C) is the charge number for the cation C, and Z_(A) is the charge number for the anion A; and if the mixture includes multiple electrolytes, the step of defining the segment number includes calculating: $r_{c,C} = {\sum\limits_{A}{r_{e,{CA}}{Z_{C}\left( {x_{A}{Z_{A}/{\sum\limits_{A^{\prime}}{x_{A^{\prime}}Z_{A^{\prime}}}}}} \right)}\mspace{14mu}{and}}}$ ${r_{a,A} = {\sum\limits_{C}{r_{e,{CA}}{Z_{A}\left( {x_{C}{Z_{C}/{\sum\limits_{C^{\prime}}{x_{C^{\prime}}Z_{C^{\prime}}}}}} \right)}}}},$ wherein: r_(e) is the segment number, r_(c) is the cationic segment number, r_(a) is the anionic segment number, where r_(c) and r_(a) satisfy electroneutrality; CA is an electrolyte, wherein C is a cation, and A is an anion; C′A′ is other electrolyte(s), wherein C′ is a cation and A′ is an anion; Z_(C) is a charge number for C, and Z_(A) is a charge number for A; Z_(C′) is a charge number for C′, and Z_(A′) is a charge number for A′; x_(A) is a mole fraction of A, and x_(C) is a mole fraction of C; and x_(A′) is a mole fraction of A′, and x_(C′) is a mole fraction of C′.
 43. A computer program product, comprising: a) a computer usable medium; and b) a set of computer program instructions embodied on the computer useable medium for conducting industrial manufacture, research or development by modeling at least one physical property of a mixture of at least two chemical species by the computer implemented steps of: aa) providing a modeler configured to be executable by a processor, the modeler during execution being formed of (i) a databank of molecular descriptors of known chemical species, and (ii) a calculator of molecular descriptors of unknown chemical species; bb) determining at least one conceptual segment, instead of a molecular structural segment, for each of the at least two chemical species, the conceptual segment being determined from in-mixture behavior of the at least two chemical species, including for each conceptual segment, (i) identifying the conceptual segment as one of a hydrophobic segment, a hydrophilic segment, a polar segment, or a combination thereof, and (ii) defining an equivalent number for the conceptual segment, the equivalent number being based on experimental phase equilibrium data and being one of carried in the databank of molecular descriptors of known chemical species or obtained using the calculator of molecular descriptors of unknown chemical species by regression of experimental phase equilibrium data for binary systems of unknown chemical species and reference chemical species; cc) providing the determined at least one conceptual segment to the modeler, and in response the modeler using the determined at least one conceptual segment to compute at least one physical property of the mixture, including any one of vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, or a combination thereof, the modeler computing the at least one physical property by determining an activity coefficient of one of the at least two chemical species, the activity coefficient being formed of at least a residual contribution to the activity coefficient of the one chemical species, the modeler setting the residual contribution equal to a local composition interaction contribution to the activity coefficient for the one chemical species based on the determined at least one conceptual segment; dd) analyzing the computed at least one physical property using the modeler, in a comparison to the computed at least one physical property of other mixtures of at least two chemical species, and forming therefrom a model of the at least one physical property of the mixture useable in conducting industrial manufacture, research or development; and ee) outputting the formed model from the modeler to a computer display monitor in a manner enabling the conducting of industrial manufacture, research or development.
 44. The computer program product of claim 43, wherein at least some portion of the computer program instructions include instructions to request data or request instructions over a telecommunications network.
 45. The computer program product of claim 43, wherein at least some portion of the computer program is transmitted over a global network.
 46. The computer program product of claim 43, wherein conducting industrial manufacture, research or development includes one or more of the following: a pharmaceutical activity, chromatography, product drying and cleaning activity in a manufacturing plant.
 47. The computer program product of claim 43, wherein conducting industrial manufacture, research or development includes one or more of the following: pharmacokinetics, pharmacodynamics, solvent screening, crystallization productivity, drug formulation, combination drug therapy, drug toxicity, a process design for an active pharmaceutical ingredient, capillary-action chromatography, paper chromatography, thin layer chromatography, column chromatography, fast protein liquid chromatography, high performance liquid chromatography, ion exchange chromatography, affinity chromatography, gas chromatography, and countercurrent chromatography.
 48. A method of conducting industrial manufacture, research or development, the method comprising: a) using a determined at least one conceptual segment and computing at least one physical property of a mixture, including any one of vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, or a combination thereof; b) analyzing the computed at least one physical property using a modeler, in a comparison to a computed at least one physical property of other mixtures of at least two chemical species, and forming therefrom a model of the at least one physical property of the mixture useable in conducting the industrial manufacture, research or development; and c) outputting the formed model from the modeler to a computer display monitor in a manner enabling the industrial manufacture, research or development.
 49. A method of conducting a pharmaceutical activity, the method comprising: a) using a determined at least one conceptual segment and computing at least one physical property of a mixture, including any one of vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, or a combination thereof; b) analyzing the computed at least one physical property using a modeler, in a comparison to a computed at least one physical property of other mixtures of at least two chemical species, and forming therefrom a model of the at least one physical property of the mixture useable in conducting the pharmaceutical activity; and c) outputting the formed model from the modeler to a computer display monitor in a manner enabling the pharmaceutical activity.
 50. A method of separating one or more chemical species from a mixture, the method comprising: a) using a determined at least one conceptual segment, computing at least one physical property of the mixture, including any one of vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, or a combination thereof; b) analyzing the computed at least one physical property using a modeler, in a comparison to a computed at least one physical property of other mixtures of at least two chemical species, and forming therefrom a model of the at least one physical property of the mixture useable in separating one or more chemical species from the mixture; and c) outputting the formed model from the modeler to a computer display monitor in a manner enabling separating one or more chemical species from the mixture.
 51. A computer program product, comprising: a) a computer usable medium; and b) a set of computer program instructions embodied on the computer useable medium for conducting industrial manufacture, research or development by modeling at least one physical property of a mixture of at least two chemical species by the computer implemented steps of: aa) receiving at a modeler a determined at least one conceptual segment, and responsively, the modeler using the determined at least one conceptual segment and computing at least one physical property of the mixture, including any one of vapor pressure, solubility, boiling point, freezing point, octanol/water partition coefficient, or a combination thereof; bb) analyzing the computed at least one physical property using the modeler, in a comparison to a computed at least one physical property of other mixtures of at least two chemical species, and forming therefrom a model of the at least one physical property of the mixture useable in conducting industrial manufacture, research or development; and cc) outputting the formed model from the modeler to a computer display monitor in a manner enabling the conducting of industrial manufacture, research or development. 